Plants resistant to C strains of cucumber mosaic virus

ABSTRACT

Coat protein genes of cucumber mosaic virus strains V27, V33, V34 and A35 (CMV V27, CMV V33, CMV V34, and CMV A35 respectively) are provided.

This application is a continuation-in-part of U.S. Ser. No. 08/367,789, filed on Dec. 30, 1994, now abandoned.

FIELD OF THE INVENTION

This invention relates to coat protein genes derived from cucumber mosaic virus strains V27, V33, V34, and A35 (CMV V27, CMV V33, CMV V34, and CMV A35, respectively). More specifically, the invention relates to the genetic engineering of plants and to a method for conferring viral resistance to a plant using an expression cassette encoding V27, V33, V34, or A35 strains of cucumber mosaic virus.

BACKGROUND OF THE INVENTION

Many agriculturally important crops are susceptible to infection by plant viruses, particularly cucumber mosaic virus, which can seriously damage a crop, reduce its economic value to the grower, and increase its cost to the consumer. Attempts to control or prevent infection of a crop by a plant virus such as cucumber mosaic virus have been made, yet viral pathogens continue to be a significant problem in agriculture.

Scientists have recently developed means to produce virus resistant plants using genetic engineering techniques. Such an approach is advantageous in that the genetic material which provides the protection is incorporated into the genome of the plant itself and can be passed on to its progeny. A host plant is resistant if it possesses the ability to suppress or retard the multiplication of a virus, or the development of pathogenic symptoms. "Resistant" is the opposite of "susceptible," and may be divided into: (1) high, (2) moderate, or (3) low resistance, depending upon its effectiveness. Essentially, a resistant plant shows reduced or no symptom expression, and virus multiplication within it is reduced or negligible. Several different types of host resistance to viruses are recognized. The host may be resistant to: (1) establishment of infection, (2) virus multiplication, or (3) viral movement.

Cucumber mosaic virus (CMV) is a single-stranded (+) RNA plant virus that has a functionally divided genome. The virus genome contains four RNA species designated RNAs 1-4. RNAs 3 and 4 encode the coat protein which is a protein that surrounds the viral RNA and protects the viral RNA from being degraded. Only RNAs 1-3 are required for infectivity because the coat protein, which is encoded by RNA 4, is also encoded by RNA 3.

Several strains of cucumber mosaic virus have been classified using serology, host range, peptide mapping, nucleic acid hybridization, and sequencing analyses. These CMV strains can be divided into two groups, which are designated "WT" (also known as subgroup I) and "S" (also known as subgroup II). The S group consists of at least three members. The WT group is known to contain at least 17 members.

Expression of the coat protein genes from tobacco mosaic virus, alfalfa mosaic virus, cucumber mosaic virus, and potato virus X, among others, in transgenic plants has resulted in plants which are resistant to infection by the respective virus. Heterologous protection can also occur. For example, the expression of coat protein genes from watermelon mosaic virus-2 or zucchini yellow mosaic virus in transgenic tobacco plants has been shown to confer protection against six other potyviruses: bean yellow mosaic virus, potato virus Y, pea mosaic virus, clover yellow vein virus, pepper mottle virus, and tobacco etch virus. However, expression of a preselected coat protein gene does not reliably confer heterologous protection to a plant. For example, transgenic squash plants containing the CMV C coat protein gene, a subgroup I virus, which have been shown to be resistant to the CMV C strain are not protected to the same degree against several highly virulent strains of CMV: CMV V27, CMV V33, CMV V34, and CMV A35 which are all subgroup I viruses.

Thus, a need exists for plants resistant to CMV V27, CMV V33, CMV V34, and CMV A35.

SUMMARY OF THE INVENTION

This invention provides: an isolated and purified DNA molecule that encodes the coat protein for the V27 strain of cucumber mosaic virus (CMV V27), and a chimeric expression cassette comprising this DNA molecule; an isolated and purified DNA molecule that encodes the coat protein for the V33 strain of cucumber mosaic virus (CMV V33), and a chimeric expression cassette comprising this DNA molecule; and an isolated and purified DNA molecule that encodes the coat protein for the V34 strain of cucumber mosaic virus (CMV V34), and a chimeric expression cassette comprising this DNA molecule; and an isolated and purified DNA molecule that encodes the coat protein for the A35 strain of cucumber mosaic virus (CMV A35), and a chimeric expression cassette comprising the DNA molecule. Another embodiment of the invention is exemplified by the insertion of multiple virus gene expression cassettes into one purified DNA molecule, e.g., a plasmid. Each of these cassettes also includes a promoter which functions in plant cells to cause the production of an RNA molecule, and at least one polyadenylation signal comprising 3' nontranslated DNA which functions in plant cells to cause the termination of transcription and the addition of polyadenylated ribonucleotides to the 3' end of the transcribed mRNA sequences, wherein the promoter is operably linked to the DNA molecule, and the DNA molecule is operably linked to the polyadenylation signal. Preferably, these cassettes include the promoter of the 35S gene of cauliflower mosaic virus and the polyadenylation signal of the cauliflower mosaic virus 35S gene.

Also provided are bacterial cells, and transformed plant cells, containing the chimeric expression cassettes comprising the coat protein genes derived from the CMV V27, CMV V33, CMV V34, or CMV A35 strains, and preferably the 35S promoter of cauliflower mosaic virus and the polyadenylation signal of the cauliflower mosaic virus 35S gene. Plants are also provided, wherein the plants comprise a plurality of transformed cells containing the chimeric coat protein gene expression cassettes derived from the CMV V27, CMV V33, CMV V34, or CMV A35 stains, and preferably the cauliflower mosaic virus 35S promoter and the polyadenylation signal of the cauliflower mosaic virus gene. Transformed plants of this invention include tobacco, beets, corn, cucumber, peppers, potatoes, elons, soybean, squash, and tomatoes. Especially preferred are members of the Cucurbitaceae (e.g., squash and cucumber,) and Solanaceae (e.g., peppers and tomatoes) family.

Another aspect of the present invention is a method of preparing a CMV-resistant plant,.such as a dicot, comprising: transforming plant cells with a chimeric expression cassette comprising a promoter functional in plant cells operably linked to a DNA molecule that encodes a coat protein as described above; regenerating the plant cells to provide a differentiated plant; and identifying a transformed plant that expresses the CMV coat protein at a level sufficient to render the plant resistant to infection by the specific strains of CMV disclosed herein.

As used herein, with respect to a DNA molecule or "gene," the phrase "isolated and purified" is defined to mean that the molecule is either extracted from its context in the viral genome by chemical means and purified and/or modified to the extent that it can be introduced into the present vectors in the appropriate orientation, i.e., sense or antisense. As used herein, the term "chimeric" refers to the linkage of two or more DNA molecules which are derived from different sources, strains or species (e.g., from bacteria and plants), or the linkage of two or more DNA molecules, which are derived from the same species and which are linked in a way that does not occur in the native genome. As used herein, "expression" is defined to mean transcription or transcription followed by translation of a particular DNA molecule.

BRIEF DESCRIPTION OF THE DRAWINGS

FIGS. 1(A-B). The nucleotide sequence of the coat protein gene of cucumber mosaic virus V27 [SEQ ID NO:1]. The deduced amino acid sequence of the encoded open reading frame is shown below the nucleotide sequence [SEQ ID NO:2].

FIGS. 2(A-B). The nucleotide sequence of the coat protein gene of cucumber mosaic virus V33 [SEQ ID NO:3]. The deduced amino acid sequence of the encoded open reading frame is shown below the nucleotide sequence [SEQ ID NO:4].

FIG. 3. The nucleotide sequence of the coat protein gene of cucumber mosaic virus V34 [SEQ ID NO:5]. The deduced amino acid sequence of the encoded open reading frame is shown below the nucleotide sequence [SEQ ID NO:6].

FIGS. 4(A-D). The alignment the nucleotide sequences of the coat protein genes from 5 CMV strains [SEQ ID NOS:1, 3, 5, 9, and 10]. Ccp and Cmvw1[SEQ ID NO:9 and 10] are described in Quemada et al. (J. Gen. Virol., 70, 1065 (1989)). Alignments were obtained with the use of the UWGCG Pileup program. The dots represent either the lack of sequence information at the 5' end of the coat protein gene or gaps in homology in sequences relative to others in the alignment. The positions of primers RMM351 and RMM352 are shown [SEQ ID NOS:7 and 8].

FIGS. 5(A-B). The alignment of the amino acid sequences deduced from the nucleotide sequences of CMV strains V27, V33, V34, CMV-C (shown in FIG. 4 [SEQ ID NO:1, 3, 5, 9 and 10]) and CMV strain Cmvq3 (Quemada et al., J. Gen. Virol., 70, 1065 (1989)) [SEQ ID NO:2, 4, 6, 11 and 12]. Alignments were performed by the UWGCG Pileup program. Differences among the "C" type viruses are underlined and highlighted with asterisks. The dots represent gaps in homology in sequences relative to others in the alignment.

FIGS. 6(A-D). (A) Assembly of CMV strain V27 coat protein expression cassette. PCR products of CMV V27 were installed into pCRII and subsequently inserted into pUC18cpexpress by routine methods. The bolded lines and arrows which are a part of the circle represent CaMV 35S sequences. (B) Insertion of a CMV V27 coat protein expression cassette BamHI fragment into the BglII site of pEPG204 and pEPG205 to produce pEPG239 and pEPG240, respectively. (C) Restriction map of pEPG239. This binary plasmid includes the coat protein expression cassettes for PRV (melon, long), CMV V27, ZYMV, and WMVII. For further information on PRV coat protein genes, refer to Applicants' Assignees copending patent application Ser. No. 08/366,881 entitled "Papaya Ringspot Virus Coat Protein Gene" filed on Dec. 30, 1994, incorporated by reference herein. For further information on ZYMV and WMVII coat protein genes, refer to Applicants' Assignees copending patent application Ser. No. 08/232,846 filed on Apr. 25, 1994 entitled "Potyvirus Coat Protein Genes and Plants Transformed Therewith", incorporated by reference herein. (D) Restriction map of pEPG240. This binary plasmid includes the coat protein expression cassettes for PRV (melon, short), CMV V27, ZYMV, and WMVII.

FIGS. 7(A-D). (A) Assembly of CMV strain V33 coat protein expression cassette. PCR products of CMV V33 were installed into pUC1318cpexpress by routine methods. (B) Insertion of a CMV V33 coat protein expression cassette BamHI fragment into the BglII site of pEPG204 and pEPG205 to produce pEPG196 and pEPG197, respectively. (C) Restriction map of pEPG196. This binary plasmid includes the coat protein expression cassettes for PRV (melon, long), CMV V33, ZYMV, and WMVII. Arrows indicate CaMV 35S promoter fragments. (D) Restriction map of pEPG197. This binary plasmid includes the coat protein expression cassettes for PRV (melon, short), CMV V33, ZYMV, and WMVII.

FIG. 8. The nucleotide sequence of the coat protein gene of cucumber mosaic virus A35 [SEQ ID NO:14]. The deduced amino acid sequence of the encoded open reading frame is shown below the nucleotide sequence [SEQ ID NO:15].

FIGS. 9(A-B). The alignment of the amino acid sequences deduced from the nucleotide sequences of the six CMV strains shown in FIG. 10 [SEQ ID NO:2, 4, 6, 11, 12 and 15]. Differences among the "C" type viruses are underlined and highlighted with asterisks. The dots represent gaps in homology in sequences relative to others in the alignment.

FIGS. 10(A-K). The alignment the nucleotide sequences of the coat protein genes from 6 CMV strains [SEQ ID NOS:1, 3, 5, 9, 10 and 14]. The dots represent either the lack of sequence information at the 5' end of the coat protein gene or gaps in homology in sequences relative to others in the alignment.

DETAILED DESCRIPTION OF THE INVENTION

Cucumber mosaic virus (CMV) is a single-stranded (+) RNA plant virus that has a functionally divided genome. The virus genome contains four RNA species designated RNAs 1-4; 3389 nucleotides (nt), 3035 nt, 2193 nt, and 1027 nt, respectively (Peden et al., Virol., 53, 487 (1973); Gould et al., Eur. J. Biochem., 126, 217 (1982); Rezaian et al., Eur. J. Biochem., 143, 227 (1984); Rezaian et al., Eur. J. Biochem. 150, 331 (1985)). Only RNAs 1-3 are required for infectivity (Peden et al., Virol., 53, 487 (1973)) because the coat protein, which is encoded by RNA 4, is also encoded by RNA 3. Translations of CMV RNAs yield a 95 kD polypeptide from RNA 1, a 94 kD polypeptide from RNA 2 (Gordon et al., Virol., 123, 284 (1983)), and two polypeptides from RNA 3: its 5' end encodes a 35 kD polypeptide, and its 3' end encodes a 24.5 kD polypeptide (Gould et al., Eur. J. Biochem., 126, 217 (1982)). The 24.5 kD polypeptide is identical to that encoded by RNA 4 and is the coat protein.

Several strains of cucumber mosaic virus have been classified using serology, host range, peptide mapping, nucleic acid hybridization, and sequencing. These CMV strains can be divided into two groups, which are designated "WT" (also known as subgroup I) and "S" (also known as subgroup II). CMV subgroup I includes CMV-C, CMV-V27, CMV-V33, CMV-V34, CMV-M, CMV-O, CMV-Y, and CMV-A35 while subgroup II includes CMV-Q, CMV-WL, and CMV-LS (Zaitlin et al., Virol., 201, 200 (1994)). Protection against a strain in one group does not necessarily provide protection against all strains in that group. For example, transgenic squash plants protected with coat protein genes from the CMV strain C are not protected against the CMV strains V27, V33, V34, or A35. In addition, Zaitlin et al. (Virol., 201, 200 (1994)) report that tobacco plants transgenic for a CMV-FNY replicase gene show protection against challenge from subgroup I strains but show no protection against challenge from subgroup II challenges. Thus, the present invention is directed to providing plants with resistance to CMV strains V27, V33, V34, and/or A35.

To practice the present invention, a viral gene must be isolated from the viral genome and inserted into a vector. Thus, the present invention provides isolated and purified DNA molecules that encode the coat proteins of the V27, V33, or V34 strains of CMV. As used herein, a DNA molecule that encodes a coat protein gene includes nucleotides of the coding strand, also referred to as the "sense" strand, as well as nucleotides of the noncoding strand, complementary strand, also referred to as the "antisense" strand, either alone or in their base-paired configuration. Thus, a DNA molecule that encodes the coat protein of the V27 strain of CMV, for example, includes the DNA molecule having the nucleotide sequence of FIG. 1 [SEQ ID NO:1], a DNA molecule complementary to the nucleotide sequence of FIG. 1 [SEQ ID NO:1], as well as a DNA molecule which also encodes a CMV coat protein and its complement which hybridizes with a CMV V27-specific DNA probe in hybridization buffer with 6×SSC, 5× Denhardt's reagent, 0.5% SDS and 100 μg/ml denatured, fragmented salmon sperm DNA and remains bound when washed at 68° C. in 0.1×SSC and 0.5% SDS (Sambrook et al., Molecular Cloning: A Laboratory Manual, 2nd ed. (1989)). Moreover, the DNA molecules of the present invention can include non-CMV coat protein nucleotides that do not interfere with expression of the CMV coat protein gene. Preferably, the isolated and purified DNA molecules of the present invention comprise a single coding region for the coat protein. Thus, preferably the DNA molecules of the present invention are those "consisting essentially of" DNA that encodes the coat protein.

These CMV genes are used to produce the coat proteins, which are believed to confer resistance to viruses. Another molecular strategy to provide virus resistance in transgenic plants is based on antisense RNA. As is well known, a cell manufactures protein by transcribing the DNA of the gene encoding that protein to produce RNA, which is then processed to messenger RNA (mRNA) (e.g., by the removal of introns) and finally translated by ribosomes into protein. This process may be inhibited in the cell by the presense of antisense RNA. The term antisense RNA means an RNA sequence which is complementary to a sequence of bases in the mRNA in question in the sense that each base (or the majority of bases) in the antisense sequence (read in the 3' to 5' sense) is capable of pairing with the corresponding base (G with C, A with U) in the mRNA sequence read in the 5' to 3' sense. It is believed that this inhibition takes place by formation of a complex between the two complementary strands of RNA, thus preventing the formation of protein. How this works is uncertain: the complex may interfere with further transcription, processing, transport or translation, or degrade the mRNA, or have more than one of these effects. This antisense RNA may be produced in the cell by transformation of the cell with an appropriate DNA construct arranged to transcribe the non-template strand (as opposed to the template strand) of the relevant gene (or of a DNA sequence showing substantial homology therewith).

The use of antisense RNA to downregulate the expression of specific plant genes is well known. Reduction of gene expression has led to a change in the phenotype of the plant: either at the level of gross visible phenotypic difference, e.g., lack of anthocyanin production in flower petals of petunia leading to colorless instead of colored petals (van der Krol et al., Nature, 333:866-869 (1988)); or at a more subtle biochemical level, e.g., change in the amount of polygalacturonase and reduction in depolymerization of pectin during tomato fruit ripening (Smith et al., Nature, 334:724-726 (1988)).

Another more recently described method of inhibiting gene expression in transgenic plants is the use of sense RNA transcribed from an exogenous template to downregulate the expression of specific plant genes (Jorgensen, Keystone Symposium "Improved Crop and Plant Products through Biotechnology", Abstract X1-022 (1994)). Thus, both antisense and sense RNA have been proven to be useful in achieving downregulation of gene expression in plants, which are encompassed by the present invention.

The CMV coat protein gene does not contain the signals necessary for its expression once transferred and integrated into a plant genome. Accordingly, a vector must be constructed to provide the regulatory sequences such that they will be functional upon inserting a desired gene. When the expression vector/insert construct is assembled, it is used to transform plant cells which are then used to regenerate plants. These transgenic plants carry the viral gene in the expression vector/insert construct. The gene is expressed in the plant and increased resistance to viral infection is conferred thereby.

Several different methods exist to isolate a viral gene. To do so, one having ordinary skill in the art can use information about the genomic organization of cucumoviruses to locate and isolate the coat protein gene. The coat protein gene is located near the 3' end of RNA 3. Using methods well known in the art, a quantity of virus is grown and harvested. The viral RNA is then separated by gel electrophoresis. A cDNA library is created using the viral RNA, by methods known to the art. The viral RNA is incubated with primers that hybridize to the viral RNA and reverse transcriptase, and a complementary DNA molecule is produced. A DNA complement of the complementary DNA molecule is produced and that sequence represents a DNA copy (cDNA) of the original viral RNA molecule. The DNA complement can be produced in a manner that results in a single double stranded cDNA or polymerase chain reactions can be used to amplify the DNA encoding the cDNA with the use of oligomer primers specific for viral sequences. These primers can include novel restriction sites used in subsequent cloning steps. Thus, a double stranded DNA molecule is generated which contains the sequence information of the viral RNA. These DNA molecules can be cloned in E. coli plasmid vectors after the additions of restriction enzyme linker molecules by DNA ligase. The various fragments are inserted into cloning vectors, such as well-characterized plasmids, which are then used to transform E. coli and create a cDNA library.

CMV coat protein genes from previously isolated strains can be used as hybridization probes to screen the cDNA library to determine if any of the transformed bacteria contain DNA fragments with sequences coding for a CMV coat protein. Alternatively, plasmids which harbor CMV coat protein sequences can be determined by restriction enzyme digestion of plasmids in bacterial transformants. The cDNA inserts in any bacterial colonies which contain this region can be sequenced. The coat protein gene is present in its entirety in colonies which have sequences that extend 5' to the sequence which encodes the ATG start codon and sequences that extend 3' of the stop codon.

Alternatively, cDNA fragments can be inserted in the sense orientation into expression vectors. Antibodies against the coat protein can be used to screen the cDNA expression library and the gene can be isolated from colonies which express the protein.

In the present invention, the DNA molecules encoding the coat protein (CP) genes of the cucumber mosaic virus strains V27, V33, V34, and A35 have been determined and the genes have been inserted into expression cassettes. These expression cassettes can be individually placed into a vector that can be transmitted into plants, preferably a binary vector. Alternatively, two or more of the CMV CP genes can each be present in an expression cassette which can be placed into the same binary vector, or any of the CMV CP expression cassettes of the present invention can be placed into a binary vector with one or more viral gene expression cassettes. The expression vectors contain the necessary genetic regulatory sequences for expression of an inserted gene. The coat protein gene is inserted such that those regulatory sequences are functional and the genes can be expressed when incorporated into a plant genome. For example, vectors of the present invention can contain combinations of expression cassettes that include DNA from a heterologous CMV coat protein gene (i.e., one from another CMV isolate), papaya ringspot virus coat protein gene, a zucchini yellow mosaic virus coat protein gene, and a watermelon mosaic virus-2 coat protein gene.

Moreover, when combinations of viral gene expression cassettes are placed in the same binary plasmid, and that multigene cassette containing plasmid transformed into a plant, the viral genes all preferably exhibit substantially the same degrees of efficacy when present in transgenic plants. For example, if one examines numerous transgenic lines containing two different intact viral gene cassettes, the transgenic line will be immune to infection by both viruses. Similarly, if a line exhibits a delay in symptom development to one virus, it will also exhibit a delay in symptom development to the second virus. Finally, if a line is susceptible to one of the viruses it will be susceptible to the other. This phenomenon is unexpected. If there were not a correlation between the efficacy of each gene in these multiple gene constructs this approach as a tool in plant breeding would probably be prohibitively difficult to use. Even with single gene constructs, one must test numerous transgenic plant lines to find one that displays the appropriate level of efficacy. The probability of finding a line with useful levels of expression can range from 10-50% (depending on the species involved). For further information refer to Applicants' assignees copending patent application Ser. No. 08/367,788 entitled "Transgenic Plants Expressing DNA Constructs Containing a Plurality of Genes to Impart Virus Resistance" filed on Dec. 30, 1994 (now abandoned), incorporated by reference herein.

In order to express the viral gene, the necessary genetic regulatory sequences must be provided. In the present invention, the coat protein genes are inserted into vectors which contain cloning sites for insertion 3' of the initiation codon and 5' of the poly(A) signal. The promoter is 5' of the initiation codon such that when genes are inserted at the cloning site, a functional unit is formed in which the inserted genes are expressed under the control of the various genetic regulatory sequences.

The segment of DNA referred to as the promoter is responsible for the regulation of the transcription of DNA into mRNA. A number of promoters which function in plant cells are known in the art and can be employed in the practice of the present invention. These promoters can be obtained from a variety of sources such as plants or plant viruses, and can include, but are not limited to, promoters isolated from the caulimovirus group such as the cauliflower mosaic virus 35S promoter (CaMV 35S), the enhanced cauliflower mosaic virus 35S promoter (enh CaMV35S), the figwort mosaic virus full-length transcript promoter (FMV35S), and the promoter isolated from the chlorophyll a/b binding protein. Other useful promoters include promoters which are capable of expressing the cucumovirus proteins in an inducible manner or in a tissue-specific manner in certain cell types in which the infection is known to occur. For example, the inducible promoters from phenylalanine ammonia lyase, chalcone synthase, hydroxyproline rich glycoprotein, extensin, pathogenesis-related proteins (e.g. PR-1a), and wound-inducible protease inhibitor from potato may be useful.

Preferred promoters for use in the present CP-containing cassettes include the constitutive promoters from CaMV, the Ti genes nopaline synthase (Bevan et al., Nucleic Acids Res. II, 369 (1983)) and octopine synthase (Depicker et al., J. Mol. Appl. Genet., 1, 561 (1982)), and the bean storage protein gene phaseolin. The poly(A) addition signals from these genes are also suitable for use in the present cassettes. The particular promoter selected is preferably capable of causing sufficient expression of the DNA coding sequences to which it is operably linked, to result in the production of amounts of the proteins or RNA effective to provide viral resistance, but not so much as to be detrimental to the cell in which they are expressed. The promoters selected should be capable of functioning in tissues including, but not limited to, epidermal, vascular, and mesophyll tissues. The actual choice of the promoter is not critical, as long as it has sufficient transcriptional activity to accomplish the expression of the preselected proteins or their respective RNAs and subsequent conferral of viral resistance to the plants.

The nontranslated leader sequence can be derived from any suitable source and can be specifically modified to increase the translation of the mRNA. The 5' nontranslated region can be obtained from the promoter selected to express the gene, an unrelated promoter, the native leader sequence of the gene or coding region to be expressed, viral RNAs, suitable eucaryotic genes, or a synthetic gene sequence. The present invention is not limited to the constructs presented in the following examples.

The termination region or 3' nontranslated region which is employed is one which will cause the termination of transcription and the addition of polyadenylated ribonucleotides to the 3' end of the transcribed mRNA sequence. The termination region can be native with the promoter region, native with the gene, or can be derived from another source, and preferably include a terminator and a sequence coding for polyadenylation. Suitable 3' nontranslated regions of the chimeric plant gene include but are not limited to: (1) the 3' transcribed, nontranslated regions containing the polyadenylation signal of Agrobacterium tumor-inducing (Ti) plasmid genes, such as the nopaline synthase (NOS) gene; and (2) plant genes like the soybean 7S storage protein genes.

Preferably, the expression cassettes of the present invention are engineered to contain a constitutive promoter 5' to its translation initiation codon (ATG) and a poly(A) addition signal (AATAAA) 3' to its translation termination codon. Several promoters which function in plants are available, however, the preferred promoter is the 35S constitutive promoters from cauliflower mosaic virus (CaMV). The poly (A) signal can be obtained from the CaMV 35S gene or from any number of well characterized plant genes, i.e., nopaline synthase, octopine synthase, and the bean storage protein gene phaseolin. The constructions are similar to that used for the expression of the CMV C coat protein in PCT Patent Application PCT/US88/04321, published on Jun. 29, 1989 as WO 89/05858, claiming the benefit of U.S. Ser. No. 07/135,591, (now abandoned) filed Dec. 21, 1987, entitled "Cucumber Mosaic Virus Coat Protein Gene", and the CMV WL coat protein in PCT Patent Application PCT/US89/03288, published on Mar. 8, 1990 as WO 90/02185, claiming the benefit of U.S. Ser. No. 07/234,404, (now abandoned) filed Aug. 19, 1988, entitled "Cucumber Mosaic Virus Coat Protein Gene."

Selectable marker genes can be incorporated into the present expression cassettes and used to select for those cells or plants which have become transformed. The marker gene employed may express resistance to an antibiotic, such as kanamycin, gentamycin, G418, hygromycin, streptomycin, spectinomycin, tetracyline, chloramphenicol, and the like. Other markers could be employed in addition to or in the alternative, such as, for example, a gene coding for herbicide tolerance such as tolerance to glyphosate, sulfonylurea, phosphinothricin, or bromoxynil. Additional means of selection could include resistance to methotrexate, heavy metals, complementation providing prototrophy to an auxotrophic host, and the like.

The particular marker employed will be one which will allow for the selection of transformed cells as opposed to those cells which are not transformed. Depending on the number of different host species one or more markers can be employed, where different conditions of election would be useful to select the different host, and would be known to those of skill in the art. A screenable marker such as the β-glucuronidase gene can be used in place of, or with, a selectable marker. Cells transformed with this gene can be identified by the production of a blue product on treatment with 5-bromo-4-chloro-3-indoyl-β-D-glucuronide (X-Gluc).

In developing the present expression construct, i.e., expression cassette, the various components of the expression construct such as the DNA molecules, linkers, or fragments thereof will normally be inserted into a convenient cloning vector, such as a plasmid or phage, which is capable of replication in a bacterial host, such as E. coli. Numerous cloning vectors exist that have been described in the literature. After each cloning, the cloning vector can be isolated and subjected to further manipulation, such as restriction, insertion of new fragments, ligation, deletion, resection, insertion, in vitro mutagenesis, addition of polylinker fragments, and the like, in order to provide a vector which will meet a particular need.

For Agrobacterium-mediated transformation, the expression cassette will be included in a vector, and flanked by fragments of the Agrobacterium Ti or Ri plasmid, representing the right and, optionally the left, borders of the Ti or Ri plasmid transferred DNA (T-DNA). This facilitates integration of the present chimeric DNA sequences into the genome of the host plant cell. This vector will also contain sequences that facilitate replication of the plasmid in Agrobacterium cells, as well as in E. coli cells.

All DNA manipulations are typically carried out in E. coli cells, and the final plasmid bearing the cucumovirus expression cassette is moved into Agrobacterium cells by direct DNA transformation, conjugation, and the like. These Agrobacterium cells will contain a second plasmid, also derived from Ti or Ri plasmids. This second plasmid will carry all the vir genes required for transfer of the foreign DNA into plant cells. Suitable plant transformation cloning vectors include those derived from a Ti plasmid of Agrobacterium tumefaciens, as generally disclosed in Glassman et al. (U.S. Pat. No. 5,258,300), or Agrobacterium rhizogenes.

A variety of techniques are available for the introduction of the genetic material into or transformation of the plant cell host. However, the particular manner of introduction of the plant vector into the host is not critical to the practice of the present invention, and any method which provides for efficient transformation can be employed. In addition to transformation using plant transformation vectors derived from the tumor-inducing (Ti) or root-inducing (Ri) plasmids of Agrobacterium, alternative methods could be used to insert the DNA constructs of the present invention into plant cells. Such methods may include, for example, the use of liposomes, electroporation (Fromm et al., Proc. Natl. Acad. Sci. USA, 82, 824 (1984)), chemicals that increase the free uptake of DNA (Paszkowski et al., EMBO J., 3, 2717 (1984)), DNA delivery via microprojectile bombardment (Klein et al., Nature, 327, 70 (1987)), microinjection (Crossway et al., Mol. Gen. Genet., 202, 179 (1985)), and transformation using viruses or pollen.

The choice of plant tissue source or cultured plant cells for transformation will depend on the nature of the host plant and the transformation protocol. Useful tissue sources include callus, suspension culture cells, protoplasts, leaf segments, stem segments, assels, pollen, embryos, hypocotyls, tuber segments, meristematic regions, and the like. The %issue source is regenerable, in that it will retain the ability to regenerate whole, fertile plants following transformation.

The transformation is carried out under conditions directed to the plant tissue of choice. The plant cells or tissue are exposed to the DNA carrying the present viral gene expression cassette(s) for an effective period of time. This can range from a less-than-one-second pulse of electricity for electroporation, to a two-to-three day co-cultivation in the presence of plasmid-bearing Agrobacterium cells. Buffers and media used will also vary with the plant tissue source and transformation protocol. Many transformation protocols employ a feeder layer of suspended culture cells (tobacco or Black Mexican Sweet Corn, for example) on the surface of solid media plates, separated by a sterile filter paper disk from the plant cells or tissues being transformed.

Following treatment with DNA, the plant cells or tissue may be cultivated for varying lengths of time prior to selection, or may be immediately exposed to a selective agent such as those described hereinabove. Protocols involving exposure to Agrobacterium will also include an agent inhibitory to the growth of the Agrobacterium cells. Commonly used compounds are antibiotics such as cefotaxime and carbenicillin. The media used in the selection may be formulated to maintain transformed callus or suspension culture cells in an undifferentiated state, or to allow production of shoots from callus, leaf or stem segments, tuber disks, and the like.

Cells or callus observed to be growing in the presence of normally inhibitory concentrations of the selective agents are presumed to be transformed and may be subcultured several additional times on the same medium to remove nonresistant sections. The cells or calli can then be assayed for the presence of the viral gene cassette, or can be subjected to known plant regeneration protocols. In protocols involving the direct production of shoots, those shoots appearing on the selective media are presumed to be transformed and can be excised and rooted, either on selective medium suitable for the production of roots, or by simply dipping the excised shoot in a root-inducing compound and directly planting it in vermiculite.

In order to produce transgenic plants exhibiting viral resistance, the viral genes must be taken up into the plant cell and stably integrated within the plant genome. Plant cells and tissues selected for their resistance to an inhibitory agent are presumed to have acquired the selectable marker gene encoding this resistance during the transformation treatment. Since the marker gene is commonly linked to the viral genes, it can be assumed that the viral genes have similarly been acquired. Southern blot hybridization analysis using a probe specific to the viral genes can then be used to confirm that the foreign genes have been taken up and integrated into the genome of the plant cell. This technique may also give some indication of the number of copies of the gene that have been incorporated. Successful transcription of the foreign gene into mRNA can likewise be assayed using Northern blot hybridization analysis of total cellular RNA and/or cellular RNA that has been enriched in a polyadenylated region. mRNA molecules encompassed within the scope of the invention are those which contain viral specific sequences derived from the viral genes present in the transformed vector which are of the same polarity as that of the viral genomic RNA such that they are capable of base pairing with viral specific RNA of the opposite polarity to that of viral genomic RNA under conditions described in Chapter 7 of Sambrook et al. (1989). Moreover, mRNA molecules encompassed within the scope of the invention are those which contain viral specific sequences derived from the viral genes present in the transformed vector which are of the opposite polarity as that of the viral genomic RNA such that they are capable of base pairing with viral genomic RNA under conditions described in Chapter 7 in Sambrook et al. (1989).

The presence of a viral gene can also be detected by immunological assays, such as the double-antibody sandwich assays described by Namba et al., Gene, 107, 181 (1991) as modified by Clark et al., J. Gen. Virol., 34, 475 (1979). See also, Namba et al., Phytopathology, 82, 940 (1992). Cucumovirus resistance can also be assayed via infectivity studies as generally disclosed by Namba et al., ibid., wherein plants are scored as symptomatic when any inoculated leaf shows veinclearing, mosaic or necrotic symptoms.

Seed from plants regenerated from tissue culture is grown in the field and self-pollinated to generate true breeding plants. The progeny from these plants become true breeding lines which are evaluated for viral resistance in the field under a range of environmental conditions. The commercial value of viral-resistant plants is greatest if many different hybrid combinations with resistance are available for sale. The farmer typically grows more than one kind of hybrid based on such differences as maturity, color or other agronomic traits. Additionally, hybrids adapted to one part of a country are not adapted to another part because of differences in such traits as maturity, disease and insect tolerance. Because of this, it is necessary to breed viral resistance into a large number of parental lines so that many hybrid combinations can be produced.

The invention will be further described by reference to the following detailed examples. Enzymes were obtained from commercial sources and were used according to the vendor's recommendations or other variations known in the art. Other reagents, buffers, etc., were obtained from commercial sources, such as Sigma Chemical Co., St. Louis, Mo., unless otherwise specified.

Most of the recombinant DNA methods employed in practicing the present invention are standard procedures, well known to those skilled in the art, and described in detail in, for example, in European Patent Application Publication Number 223,452, published Nov. 29, 1986, which is incorporated herein by reference. General references containing such standard techniques include the following: R. Wu, ed., Methods in Enzymology, Vol. 68 (1979); J. H. Miller, Experiments in Molecular Genetics (1972); J. Sambrook et al., Molecular Cloning: A Laboratory Manual, 2nd ed. (1989); and D. M. Glover, ed., DNA Cloning Vol. II (1982).

FIGS. 6 and 7 are presented to illustrate the constructions of this invention.

EXAMPLE I

A. Isolation of CMV RNAs

Zucchini squash plants (20-day old) were inoculated with CMV strains V27, V33, or V34; after 7-10 days, infected leaves were harvested and CMV virus particles were isolated. The procedure used was based on protocols from Lot et al., Annals of Phytopathology, 4, 25 (1972), Francki et al., CMI/AAB Descriptions of Plant Viruses, (July, 1979), and Habili and Francki, Virology, 57, 292 (1974). Approximately 100 g of fresh leaves were extracted in an equal volume (w/v) of 0.5 M Na-citrate (pH 6.5) containing 5 mM EDTA and 100 mL of chloroform. After centrifugation of the extract at 12,000×g for 10 minutes, polyethyleneglycol ("PEG", Sigma Chemical Co. PEG-8000, average molecular weight, Research Grade) was added to the supernatant to a final concentration of 10% and the suspension was stirred for 30-40 minutes at 0-4° C. This suspension was centrifuged at 12,000×g for 10 minutes, and the pellet was resuspended in 40-50 mL of 5 mM Na-borate buffer (pH 9.0) containing 0.5 M EDTA. TRITON X-100 was then added to the the virus particle suspension to a final concentration of 2% and stirred on ice for 30 minutes. This suspension was then centrifuged at 19,000×g for 15 minutes, and the supernatant was collected and subsequently centrifuged at 105,000×g for 2 hours. The virus pellet was collected and resuspended in about 2 mL of 5 mM Na-borate buffer (pH 9.0) containing 0.5 mM EDTA. The resuspended virus preparation was applied onto a step sucrose gradient consisting of 5 layers: 5%, 10%, 15%, 20%, and 25% sucrose dissolved in 2.0 mM Na-phosphate buffer (pH 7.5). Gradients were centrifuged at 37,000 rpm in a Sorvall TH641 swinging bucket rotor for 45 minutes. After centrifugation, the virus band was harvested, the virus preparation was dialyzed against Na-borate buffer, and LiCl was added (2M final concentration) to lyse the virions and to precipitate viral RNA. CMV RNA was dissolved and reprecipitated with ethanol and dissolved in water. By agarose gel electrophoresis, the expected four RNA species were observed.

B. Cloning CMV Coat Protein Genes

(a) CMV V27

The first cDNA strand of CMV V27 was synthesized with the use of Perkin-Elmer RT-PCR kit reagents and the primer RMM352 (shown in FIG. 4, [SEQ ID NO:8]); immediately in the same reaction tube, a polymerase chain reaction (PCR) was carried out with the use of oligonucleotide primers RMM351 and RMM352 (shown in FIG. 4, [SEQ ID NOS:7 and 8] following the manufacturer's protocol. The ATG translation start is included in the NcoI site present in primer RMB51. Individual PCR product molecules were cloned using the TA Cloning™ kit (Invitrogen Corp., San Diego, Calif.) into pCRII (included in the TA Cloning™ kit as a linearized plasmid with single 3' dT overhangs at the ends of the molecule). Three clones were isolated for further study: CMVV27TA21, CMVV27TA23, and CMVV27TA26. With the use of a kit (Sequenase 2 purchased from USB, Cleveland, Ohio), the CMV V27 insert in clone CMVV27TA21 was sequenced.

CMMV27 was compared to 11 different CMV isolates: Cmvbaul, Cmvq3, Cmvw1, Cmvtrk7, Cmvfc, Cmvi17f, Cmvc, Cmvpr50, Cmvv27, Cmvp6, Cmvo, Cmvm, and Cmvy. CMVV27 coat protein is similar to CMV-Y in that it contains a serine at position 29 while other strains have an alanine at this position. However, CMV-Y contains a leucine at position 18 while CMVV27 contains a proline at position 18. In addition, CMVV27 has a methionine at position 206, no other CMV-C group viruses have a methionine at this position (Baulcombe, D., "Mutational analysis of CMV RNA3: Effects on RNA3 accumulation, RNA4 synthesis and plant infection." Unpublished Direct Submission. Submitted (Jun. 19, 1992) David Baulcombe, The Sainsbury Laboratory, Norwich Research Park, Colney Lane, Norwich, NR2 7UH, United Kingdom; Hayakawa et al., Gene, 71, 107 (1988); Hayakawa et al., J. Gen. Virol. 70, 499 (1989); Owen et al., J. Gen. Virol., 71, 2243 (1990); Pappu et al., "The nucleotide and the deduced amino acid sequences of coat protein genes of three Puerto Rican isolates of cucumber mosaic virus." Unpublished (1992). This sequence is included in the GeneBank sequence data base; Salanki et al., "Complete nucleotide sequence of RNA 3 from cucumber mosaic virus strain Trk 7." Unpublished (1993). This sequence is included in the GeneBank data base; Shintaku, J. Gen. Virol. 72, 2587 (1991)).

Cucumber Mosaic Virus Strain V27 was deposited with the American Type Culture Collection (ATCC), 10801 University Blvd., Manassas, Va. on Sep. 15, 1999 and assigned ATCC Deposit Number PTA-658. This deposit was made in compliance with the requirements of the Budapest Treaty that the duration of the deposit should be for thirty (30) years from the date of deposit or for five (5) years after the last request for the deposit at the depository or for the enforceable life of a U.S. patent that matures from this application, whichever is longer. Cucumber Mosaic Virus Strain V27 will be replenished should this virus become non-viable at the depository.

(b) CMV V33

CMV V33 was purified and viral RNA extracted from a virion preparation as described above; subsequently single stranded cDNA was synthesized using Perkin-Elmer RT-PCR kit reagents and oligomer primer RMM352 [SEQ ID NO:8]. The coat protein gene of strain V33 was amplified using PCR as described above for V27 with the use of oligomer primers RMM351 and RMM352 (FIG. 4, [SEQ ID NOS:7 and 8, respectively]). The V33 CP gene PCR product was digested with NcoI and directly cloned into the expression cassette cpexpress installed into pUC1318 (see Kay and McPherson, Nucleic Acid Research, 15, 2779 (1987) for pUC1318; Slightom, Gene 100, 251 (1991) for cpexpress; pUC1318cpexpress is the cpexpress described in Slightom, however it is installed into the HindIII site of the modified pUC plasmid pUC1318 described in detail in Kay and McPherson), rather than into the intermediate vector pCRII. By colony hybridization with a CMV coat protein probe, a number of clones were identified for further analysis: V33cel, V33ce2, V33ce7, and V33ce9. The CMV V33 insert in clone V33ce7 was sequenced with the use of a kit (Sequenase 2 purchased from USB, Cleveland, Ohio).

CMMV33 was compared to 11 different CMV isolates: Cmvbaul, Cmvq3, Cmvw1, Cmvtrk7, Cmvfc, Cmvi17f, Cmvc, Cmvpr50, Cmw27, Cmvp6, Cmvo, Cmvm, and Cmvy. CMVV33 has a serine at position 67 while all other CMV strains compared included a proline at this position. At position 196, both CMVV33 and CMV-Y have a valine residue; all other members of the CMV-C group contains isoleucine at this position. However, at position 184, CMVV33 has an alanine residue while CMV-Y has a threonine residue. Therefore, CMVV33 coat protein is unique (Baulcombe, D., "Mutational analysis of CMV RNA3: Effects on RNA3 accumulation, RNA4 synthesis and plant infection." Unpublished Direct Submission. Submitted (Jun. 19, 1992) David Baulcombe, The Sainsbury Laboratory, Norwich Research Park, Colney Lane, Norwich, NR2 7UH, United Kingdom; Hayakawa et al., Gene, 71, 107 (1988); Hayakawa et al., J. Gen. Virol. 70, 499 (1989); Owen et al., J. Gen. Virol., 71, 2243 (1990); Pappu et al., "The nucleotide and the deduced amino acid sequences of coat protein genes of three Puerto Rican isolates of cucumber mosaic virus." Unpublished (1992). This sequence is included in the GeneBank sequence data base; Salanki et al., "Complete nucleotide sequence of RNA 3 from cucumber mosaic virus strain Trk 7." Unpublished (1993). This sequence is included in the GeneBank data base; Shintaku, J. Gen. Virol. 72, 2587 (1991)).

(c) CMV V34

CMV V34 RNA was prepared as described above. Subsequently, the first cDNA strand was synthesized using CMV V34 template in a reaction that included the following: approximately 2 μg CMV V34 RNA, 1× buffer for Superscript Reverse Transcriptase (supplied by BRL-GIBCO, Grand Island, N.Y.), 2 mM dNTPs, oligomer primer RMM352 (37.5 μg/mL, SEQ ID NO:8), 1.5 μL RNasin, and 1 μL Superscript Reverse Transcriptase (BRL-GIBCO) in a 20-μL reaction. After this reaction was allowed to proceed for 30 minutes, an aliquot of the first strand reaction was used as a template in a polymerase chain reaction to amplify the CMV V34 coat protein gene. The CMV V34 coat protein gene PCR product was cloned into the pCRII vector included in the TA Cloning™ Kit supplied by Invitrogen Corp. Two clones were isolated for further study: TA17V34 and TA112V34. The CMV V34 insert of clone TA17V34 was sequenced with the use of a kit (Sequenase 2 purchased from USB, Cleveland, Ohio). Comparative sequence analysis of the CMVV34 coat protein gene with other CMV coat protein genes (Cmvbaul, Cmvq3, Cmvw1, Cmvtrk7, Cmvfc, Cmvi17f, Cmvc, Cmvpr5O, Cmvv27, Cmvp6, Cmvo, Cmvm, and Cmvy) showed that the CMVV34 coat protein gene is unique (Baulcombe,D. Mutational analysis of CMV RNA3: Effects on RNA3 accumulation, RNA4 synthesis and plant infection. Unpublished Direct Submission. Submitted (Jun. 19, 1992) David Baulcombe, The Sainsbury Laboratory, Norwich Research Park, Colney Lane, Norwich, NR2 7UH, United Kingdom; Hayakawa et al., Gene, 71, 107 (1988); Hayakawa et al., J. Gen. Virol. 70, 499 (1989); Owen et al., J. Gen. Virol., 71, 2243 (1990); Pappu et al., (1992) The nucleotide and the deduced amino acid sequences of coat protein genes of three Puerto Rican isolates of cucumber mosaic virus. Unpublished. This sequence is included in the GeneBank sequence data base; Salanki et al., Complete nucleotide sequence of RNA 3 from cucumber mosaic virus strain Trk 7. Unpublished (1993) This sequence is included in the GeneBank data base; Shintaku, J. Gen. Virol. 72, 2587 (1991)).

C. Engineering CMV Coat Protein Genes

(a) CMV V27

The NcoI fragment in CMVV27TA21 that harbors CMVV27 CP coding sequences was excised from CMVV27TA21 and inserted into the plant expression cassette cpexpress in pUC18 to give CMVV27TA21ce42. The resulting expression cassette was isolated as a partial HindIII fragment and inserted into the binary vector pGA482G [The parent binary plasmid was pGA482, constructed by An (Plant Physiol., 81, 86 (1986)). This binary vector contains the T-DNA border sequences from pTiT37, the selectable marker gene Nos-NPT II (which contains the plant-expressible nopaline gene promoter fused to the bacterial NPT II gene obtained from Tn5), a multiple cloning region, and the cohesive ends of phage lambda (An, Plant Physiol., 81, 86 (1986))] to yield pEPG191 and pEPG192. Subsequently, a PRV coat protein expression cassette was installed to obtain a binary vector that included both CMV V27 CP and PRV CP expression cassettes.

Alternatively, the CMV V27 CP NcoI fragment obtained from CMV V27TA21 was installed into pUC1318cp express (see Kay et al., Nucleic Acid Research, 15, 2779 (1987) for pUC1318; Slightom, Gene 100, 251 (1991) for cpexpress; pUC1318cpexpress is the cpexpress described in Slightom, however it is installed into the HindIII site of the modified pUC plasmid pUC 1318 described in detail in Kay et al.) to give the plasmid CMVV27TA21CE13 (similar to CMVV27TA21ce42). The plasmid pUC1318 provided additional sites (e.g., BamHI and Xbal) with which the cassette could be inserted into the binary vector pGA482G Subsequently, the bacteria-derived gentamicin-(3)-N-acetyl-transferase gene (Allmansberger et al., Mol. Gen. Genet., 198, 514 (1985)) was installed into a SalI site outside of the T-DNA region, adjacent to the left border (B_(L))). The BamHI fragment harboring the CMV strain V27 CP expression cassette was isolated and inserted into the BglII site of the binary plasmid pEPG205 (PRV34/Z72/WMBN22) to give pEPG240 (CMVV27/PRV34/Z72/WMBN22). The BamHI fragment was also installed into the BglII site of the binary plasmid pEPG204 (PRV16/Z72/WMBN22) to yield pEPG239 (CMVV2716/PRV16/Z72/WMN22) (Table 1). For further information on PRV coat protein genes, refer to Applicants' assignees copending patent application Ser. No. 08/366,881 entitled "Papaya Ringspot Virus Coat Protein Gene" filed on Dec. 30, 1994 (now abandoned), incorporated by reference herein. For further information on ZYMV and WMVII coat protein genes, refer to Applicants' assignees copending patent application Ser. No. 08/232,846 filed on Apr. 25, 1994 (now abandoned) entitled "Potyvirus Coat Protein Genes and Plants Transformed Therewith", incorporated by reference herein.

                  TABLE 1                                                          ______________________________________                                         Binary Parental Plasmid                                                                            Site    CMVcp Cassette                                                                            pEPG#                                   ______________________________________                                         pGA482G                                                                               pGA482G      HindIII CMVV27cpexpress                                                                           191 or                                        192                                                                        pPRBN   pEPG204(P16sZW)   BglII    CMVV27cpexpress  239                        pPRBN   pEPG204(P16sZW)   BglII    CMVV27cpexpress  240                        pPRBN   pEPG106(ZW)       HindIII  CMVV27cpexpress  243                        pGA482G pGA482G           HindIII  CMVV33ce7        198                        pPRBN   pEPG106(ZW)       HindIII  CMVV33ce7        244                        pPRBN   pEPG204(P16sZW)   BglII    CMVV27ce7        196                        pPRBN   pEPG205(P34sZW)   BglII    CMVV27ce7        197                        pGA482G pGA482G           HindIII  17V34cpexp113    190                      ______________________________________                                    

(b) CMV V33

Subsequently, both HindIII and BamHI fragments were excised from clone V33ce7; these fragments carried the complete expression cassette for CMV V33 CP gene. The BamHI fragment (V33 CP expression cassette) was inserted into the BglII site of pEPG204 (PRV16/ZY72/WMBN22) to obtain pEPG196. The BamHI fragment was also inserted into the BglII site of pEPG205 (PRV34/ZY72/WMBN22) to obtain pEPG197 (V3329/PRV34/ZY72/WMBN22). The HindIII fragment harboring the V33 CP cassette was installed into pGA482G to obtain pEPG198 (Table 1).

(c) CMV V34

An NcoI fragment excised from clone TA17V34 was installed into the NcoI site of pUC1318 cpexpress. A resulting plasmid that includes the CMV V34 coding NcoI fragment inserted in the sense orientation is 17V34/cpexp113. A partial HindIII fragment from the plasmid 17V34/cpexp113 was isolated and installed into pGA482G to yield pEPG190 (Table 1).

(d) Agrobacterium Strains

The binary plasmids described here, such as pPRBN (for further information on these plasmids, refer to Applicants' Assignees copending patent application Ser. No. 08/366,991 entitled "Transgenic Plants Expressing DNA Constructs Containing a Plurality of Genes to Impart Virus Resistance" filed on Dec. 30, 1994 (now abandoned), incorporated by reference herein) or their derivatives, can be transferred into Agrobacterium strains A208, C58, LBA4404, C58Z707, A4RS, A4RS(pRi278b), Mog301 and others. Strains A208, C58, LBA4404, and A4RS are available from ATCC, 12301 Parklawn Drive, Rockville, Md. A4RS (pRi278b) was obtained from Dr. F. Casse-Delbart, C.N.R.A., Route de Saint Cyr, F78000, Versailles, France. C58Z707 was obtained from Dr. A. G. Hepburn, University of Illinois, Urbana, Ill. Mog301 was obtained from Mogen NV, Leiden, Netherlands.

D. Transfer of CMV Coat Protein Genes to Tobacco

In order to test whether the CMV CP gene constructs described herein confer protection against CMV challenge with homologous strains, some of the binary plasmids listed above (e.g., pEPG197, pEPG198, pEPG239, and pEPG240) have been used to insert these novel CMV coat protein genes into Nicotiana tobacum. Agrobacterium-mediated transfer of the plant expressible CMV coat protein genes described herein was done using the methods described in PCT published application WO 89/05859, entitled "Agrobacterium Mediated Transformation of Germinating Plant Seeds".

Five R₁ progeny lines of Nicotiana t. transformed with the binary plasmid pEPG239 and five R₁ progeny lines of Nicotiana t. transformed with the binary plasmid pEPG240 have been obtained. These binary plasmids include the coat protein gene of CMV strain V27. The ten R₀ parental plants of these lines were assayed for NPTII protein expression by ELISA. They each expressed NPTII protein by ELISA. Furthermore, these ten lines were assayed for both the NPTII and CMV V27 coat protein genes by PCR analysis. PCR analysis detected both genes in all ten R₀ plants.

The binary plasmid pEPG198 was used to obtain 11 R₀ transgenic Nicotiana t. plants. By PCR analysis, the CMV V33 CP gene was detected in nine of the eleven R₀ plants tested.

Cloning and engineering CMV A35 CP Gene

20-day-old zucchini squash plants in the greenhouse were inoculated with CMV strain A35; after 7-10 days infected leaves were harvested. Total RNA was isolated from these infected plants by the use of Tri-Reagent and the instructions provided with the reagent (Molecular Research Center, inc., Cincinnati, Ohio). Single-stranded cDNA was synthesized using total RNA template. The reaction included 1× first Strand cDNA Synthesis Buffer (GIBCO-BRL), 1 mM dNTP's (Pharmacia), 2 uL oligonucleotide primer RMM352 (150 ug/mL), 2 uL RNasin (Promega), and 1 uL RTase SuperscriptII (GIBCO-BRL) in a 20 uL reaction volume. The CMV A35 coat protein gene was PCR amplified with the use of CMV coat protein-specific primers RMM351 and 352 [SEQ ID NOS:7 and 8]. The PCR included 3 uL of the cDNA synthesis reaction described above, 8 uL of each primer RMM351 and RMM352 (150 ug/uL stock), 5 uL 10× reaction buffer, 4 uL dNTP's (10 mM), 1.5 uL MgCl₂ (50 mM), and 0.5 uL Taq polymerase (BRL-GIBCO). PCR conditions were carried out as follows: 93° 45 sec, 50° 45 sec, then 72° 180 sec for 30 cycles, then 720 for 5 min, then hold at 4°. PCR products were visualized by agarose gel electrophoresis and subsequently cloned.

PCR product molecules were cloned into the pCRII vector supplied with the TA cloning kit (Invitrogen Corp.) Four clones were identified and restriction mapped, however, none were sequenced for further analysis.

Alternatively, an aliquot of the CMV A35 PCR product was digested with NcoI and ligated it into the NcoI site of pUC19B2 cp express to give the plasmid CMV A35cpexp33. The cost protein insert of this plasmid was sequenced with the use of the Sequenase II Kit supplied by USBiochemical (FIG. 8). Sequence analysis reveals that CMV A35 coat protein sequence differs form the coast protein sequences of CMV C, V27, V33, V34, and WL (FIGS. 9 and 10). For example, A35 differs from other CMV C strains at amino acid position #26 (FIG. 9). Examination of the nucleotide sequence comparisons differs from other CMV coat protein genes characterized (FIG. 10).

A BamHI/BIlII fragment was excised from A35cpexp33 and installed into the unique BglII site of pGA482G. The plasmid pUC19B2cpexp provides a BamHI site at the 5' end of the cpexp cassette and a BglII site at the 3' end of the expression cassette. Upon insertion into a BglII site, the unique BglII site of the binary plasmid pGA482 is maintained for subsequent insertions of gene cassettes. Binary plasmids that include the CMV A35 expression cassette are being transformed into various Agrobacterium strains (eg., C58Z707, Mog301, and LBA4404). These Agrobacterium strains are used to transform plants to impart resistance to CMV CARNA5.

All publications, patents and patent documents are incorporated by reference herein, as though individually incorporated by reference. The invention has been described with reference to various specific and preferred embodiments and techniques. However, it should be understood that many variations and modifications may be made while remaining within the spirit and scope of the invention.

    __________________________________________________________________________     #             SEQUENCE LISTING                                                    - -  - - (1) GENERAL INFORMATION:                                              - -    (iii) NUMBER OF SEQUENCES: 15                                           - -  - - (2) INFORMATION FOR SEQ ID NO:1:                                      - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 772 base - #pairs                                                  (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: cDNA                                               - -    (iii) HYPOTHETICAL: NO                                                  - -     (iv) ANTI-SENSE: NO                                                    - -     (vi) ORIGINAL SOURCE:                                                           (A) ORGANISM: Cucumber - #Mosaic Virus                                         (C) INDIVIDUAL ISOLATE: - #V-27                                       - -     (ix) FEATURE:                                                                   (A) NAME/KEY: CDS                                                              (B) LOCATION: 3..660                                                  - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1:                                - - CC ATG GAC AAA TCT GAA TCA ACC AGT GCT GGT - # CGT AAC CGT CGG CGT             47                                                                          Met Asp Lys Ser Glu Ser Thr Ser Ala - #Gly Arg Asn Arg Arg Arg                   1             - #  5                - #  10                - #  15         - - CGT CCG CGT CGT GGT TCC CGC TCC GCC TCC TC - #C TCC TCG GAT GCT AAC            95                                                                        Arg Pro Arg Arg Gly Ser Arg Ser Ala Ser Se - #r Ser Ser Asp Ala Asn                             20 - #                 25 - #                 30               - - TTT AGA GTC TTG TCG CAG CAG CTT TCG CGA CT - #T AAC AAG ACG TTA GCA           143                                                                        Phe Arg Val Leu Ser Gln Gln Leu Ser Arg Le - #u Asn Lys Thr Leu Ala                         35     - #             40     - #             45                   - - GCT GGT CGT CCA ACT ATT AAC CAC CCA ACC TT - #T GTA GGG AGT GAA CGC           191                                                                        Ala Gly Arg Pro Thr Ile Asn His Pro Thr Ph - #e Val Gly Ser Glu Arg                     50         - #         55         - #         60                       - - TGT AAA CCT GGG TAC ACG TTC ACA TCT ATT AC - #C CTA AAG CCA CCA AAA           239                                                                        Cys Lys Pro Gly Tyr Thr Phe Thr Ser Ile Th - #r Leu Lys Pro Pro Lys                 65             - #     70             - #     75                           - - ATA GAC CGT GGG TCT TAT TAC GGT AAA AGG TT - #G TTA TTA CCT GAT TCA           287                                                                        Ile Asp Arg Gly Ser Tyr Tyr Gly Lys Arg Le - #u Leu Leu Pro Asp Ser             80                 - # 85                 - # 90                 - # 95        - - GTC ACG GAA TAT GAT AAG AAG CTT GTT TCG CG - #C ATT CAA ATT CGA GTT           335                                                                        Val Thr Glu Tyr Asp Lys Lys Leu Val Ser Ar - #g Ile Gln Ile Arg Val                            100  - #               105  - #               110               - - AAT CCT TTG CCG AAA TTT GAT TCT ACC GTG TG - #G GTA ACA GTC CGT AAA           383                                                                        Asn Pro Leu Pro Lys Phe Asp Ser Thr Val Tr - #p Val Thr Val Arg Lys                        115      - #           120      - #           125                   - - GTT CCT GCC TCC TCG GAC TTA TCC GTT GCC GC - #C ATC TCT GCT ATG TTC           431                                                                        Val Pro Ala Ser Ser Asp Leu Ser Val Ala Al - #a Ile Ser Ala Met Phe                    130          - #       135          - #       140                       - - GCG GAC GGA GCC TCA CCG GTA CTG GTT TAT CA - #G TAT GCT GCA TCT GGA           479                                                                        Ala Asp Gly Ala Ser Pro Val Leu Val Tyr Gl - #n Tyr Ala Ala Ser Gly                145              - #   150              - #   155                           - - GTC CAA GCT AAC AAC AAA TTG TTG TAT GAT CT - #T TCG GCG ATG CGC GCT           527                                                                        Val Gln Ala Asn Asn Lys Leu Leu Tyr Asp Le - #u Ser Ala Met Arg Ala            160                 1 - #65                 1 - #70                 1 -       #75                                                                               - - GAT ATA GGT GAC ATG AGA AAG TAC GCC GTC CT - #C GTG TAT TCA AAA         GAC      575                                                                     Asp Ile Gly Asp Met Arg Lys Tyr Ala Val Le - #u Val Tyr Ser Lys Asp                           180  - #               185  - #               190               - - GAT GCG CTC GAG ACG GAC GAG CTA GTA CTT CA - #T GTT GAC ATC GAG CAC           623                                                                        Asp Ala Leu Glu Thr Asp Glu Leu Val Leu Hi - #s Val Asp Ile Glu His                        195      - #           200      - #           205                   - - CAA CGT ATT CCC ACG TCT GGG ATG CTC CCA GT - #C TGA T TCCGTGTTCC              670                                                                        Gln Arg Ile Pro Thr Ser Gly Met Leu Pro Va - #l  *                                     210          - #       215                                              - - CAGAACCCTC CCTCCGATTT CTGTGGCGGG AGCTGAGTTG GCAGTTCTGC TA -              #TAAACTGT    730                                                                  - - CTGAAGTCAC TAAACGTTTC ACGGTGAACG GGTTGTCCAT GG    - #                       - # 772                                                                      - -  - - (2) INFORMATION FOR SEQ ID NO:2:                                      - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 218 amino - #acids                                                 (B) TYPE: amino acid                                                           (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: protein                                            - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:2:                                - - Met Asp Lys Ser Glu Ser Thr Ser Ala Gly Ar - #g Asn Arg Arg Arg Arg         1               5 - #                 10 - #                 15               - - Pro Arg Arg Gly Ser Arg Ser Ala Ser Ser Se - #r Ser Asp Ala Asn Phe                    20     - #             25     - #             30                   - - Arg Val Leu Ser Gln Gln Leu Ser Arg Leu As - #n Lys Thr Leu Ala Ala                35         - #         40         - #         45                       - - Gly Arg Pro Thr Ile Asn His Pro Thr Phe Va - #l Gly Ser Glu Arg Cys            50             - #     55             - #     60                           - - Lys Pro Gly Tyr Thr Phe Thr Ser Ile Thr Le - #u Lys Pro Pro Lys Ile        65                 - # 70                 - # 75                 - # 80        - - Asp Arg Gly Ser Tyr Tyr Gly Lys Arg Leu Le - #u Leu Pro Asp Ser Val                        85 - #                 90 - #                 95               - - Thr Glu Tyr Asp Lys Lys Leu Val Ser Arg Il - #e Gln Ile Arg Val Asn                   100      - #           105      - #           110                   - - Pro Leu Pro Lys Phe Asp Ser Thr Val Trp Va - #l Thr Val Arg Lys Val               115          - #       120          - #       125                       - - Pro Ala Ser Ser Asp Leu Ser Val Ala Ala Il - #e Ser Ala Met Phe Ala           130              - #   135              - #   140                           - - Asp Gly Ala Ser Pro Val Leu Val Tyr Gln Ty - #r Ala Ala Ser Gly Val       145                 1 - #50                 1 - #55                 1 -       #60                                                                               - - Gln Ala Asn Asn Lys Leu Leu Tyr Asp Leu Se - #r Ala Met Arg Ala         Asp                                                                                              165  - #               170  - #               175              - - Ile Gly Asp Met Arg Lys Tyr Ala Val Leu Va - #l Tyr Ser Lys Asp Asp                   180      - #           185      - #           190                   - - Ala Leu Glu Thr Asp Glu Leu Val Leu His Va - #l Asp Ile Glu His Gln               195          - #       200          - #       205                       - - Arg Ile Pro Thr Ser Gly Met Leu Pro Val                                       210              - #   215                                                  - -  - - (2) INFORMATION FOR SEQ ID NO:3:                                      - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 792 base - #pairs                                                  (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: cDNA                                               - -    (iii) HYPOTHETICAL: NO                                                  - -     (iv) ANTI-SENSE: NO                                                    - -     (vi) ORIGINAL SOURCE:                                                           (A) ORGANISM: CUCUMBER - #MOSAIC VIRUS                                         (B) STRAIN: v-33                                                      - -     (ix) FEATURE:                                                                   (A) NAME/KEY: CDS                                                              (B) LOCATION: 3..660                                                  - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:3:                                - - CC ATG GAC AAA TCT GAA TCA ACC AGT GCT GGT - # CGT AAC CGT CGA CGT             47                                                                           Met Asp Lys Ser Glu Ser Thr Ser Ala - #Gly Arg Asn Arg Arg Arg                 220               - #  225               - #  230                            - - CGT CCG CGT CGT GGT TCC CGC TCC GCC CCC TC - #C TCC GCG GAT GCC AAC            95                                                                        Arg Pro Arg Arg Gly Ser Arg Ser Ala Pro Se - #r Ser Ala Asp Ala Asn            235                 2 - #40                 2 - #45                 2 -       #50                                                                               - - TTT AGA GTC TTG TCG CAG CAG CTT TCG CGA CT - #T AAT AAG ACG TTG         TCA      143                                                                     Phe Arg Val Leu Ser Gln Gln Leu Ser Arg Le - #u Asn Lys Thr Leu Ser                           255  - #               260  - #               265               - - GCT GGT CGT CCA ACT ATT AAC CAC CCA ACC TT - #T GTA GGG AGT GAG CGT           191                                                                        Ala Gly Arg Pro Thr Ile Asn His Pro Thr Ph - #e Val Gly Ser Glu Arg                        270      - #           275      - #           280                   - - TGT AAA TCT GGG TAC ACG TTC ACA TCT ATT AC - #C CTA AAG CCG CCG AAA           239                                                                        Cys Lys Ser Gly Tyr Thr Phe Thr Ser Ile Th - #r Leu Lys Pro Pro Lys                    285          - #       290          - #       295                       - - ATA GAC CGT GGG TCT TAT TAT GGT AAA AGG TT - #G TTA TTA CCT GAT TCA           287                                                                        Ile Asp Arg Gly Ser Tyr Tyr Gly Lys Arg Le - #u Leu Leu Pro Asp Ser                300              - #   305              - #   310                           - - GTC ACA GAA TAT GAT AAG AAA CTT GTT TCG CG - #C ATT CAA ATT CGA GTT           335                                                                        Val Thr Glu Tyr Asp Lys Lys Leu Val Ser Ar - #g Ile Gln Ile Arg Val            315                 3 - #20                 3 - #25                 3 -       #30                                                                               - - AAT CCC TTG CCG AAA TTT GAT TCT ACC GTG TG - #G GTG ACA GTC CGT         AAA      383                                                                     Asn Pro Leu Pro Lys Phe Asp Ser Thr Val Tr - #p Val Thr Val Arg Lys                           335  - #               340  - #               345               - - GTT CCT GCC TCC TCG GAC TTA TCC GTT GCC GC - #C ATC TCT GCT ATG TTT           431                                                                        Val Pro Ala Ser Ser Asp Leu Ser Val Ala Al - #a Ile Ser Ala Met Phe                        350      - #           355      - #           360                   - - GCG GAC GGA GCC TCA CCG GTA CTG GTT TAT CA - #G TAC GCT GCA TCT GGA           479                                                                        Ala Asp Gly Ala Ser Pro Val Leu Val Tyr Gl - #n Tyr Ala Ala Ser Gly                    365          - #       370          - #       375                       - - GTC CAA GCT AAC AAC AAA TTG TTG TAT GAT CT - #T TCG GCG ATG CGC GCT           527                                                                        Val Gln Ala Asn Asn Lys Leu Leu Tyr Asp Le - #u Ser Ala Met Arg Ala                380              - #   385              - #   390                           - - GAT ATA GGC GAC ATG AGA AAG TAC GCC GTC CT - #C GTG TAT TCA AAA GAC           575                                                                        Asp Ile Gly Asp Met Arg Lys Tyr Ala Val Le - #u Val Tyr Ser Lys Asp            395                 4 - #00                 4 - #05                 4 -       #10                                                                               - - GAT GCA CTC GAG ACG GAC GAG CTA GTA CTT CA - #T GTT GAC GTC GAG         CAC      623                                                                     Asp Ala Leu Glu Thr Asp Glu Leu Val Leu Hi - #s Val Asp Val Glu His                           415  - #               420  - #               425               - - CAA CGC ATT CCC ACG TCT GGG GTG CTC CCA GT - #A TAA T TCTGTGCTTT              670                                                                        Gln Arg Ile Pro Thr Ser Gly Val Leu Pro Va - #l  *                                         430      - #           435                                          - - CCAGAACCCT CCCTCCGATT TCTGTGGCGG GAGCTGAGTT GGCAGTTCTG CT -              #GTAAACTG    730                                                                  - - TCTGAAGTCA CTAAACGTTT TACGGTGAAC GGGTTGTCCA TGGGTTTCGG TT -             #TTTTTGTT    790                                                                  - - AA                  - #                  - #                  - #                  792                                                                   - -  - - (2) INFORMATION FOR SEQ ID NO:4:                                      - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 218 amino - #acids                                                 (B) TYPE: amino acid                                                           (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: protein                                            - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:4:                                - - Met Asp Lys Ser Glu Ser Thr Ser Ala Gly Ar - #g Asn Arg Arg Arg Arg         1               5 - #                 10 - #                 15               - - Pro Arg Arg Gly Ser Arg Ser Ala Pro Ser Se - #r Ala Asp Ala Asn Phe                    20     - #             25     - #             30                   - - Arg Val Leu Ser Gln Gln Leu Ser Arg Leu As - #n Lys Thr Leu Ser Ala                35         - #         40         - #         45                       - - Gly Arg Pro Thr Ile Asn His Pro Thr Phe Va - #l Gly Ser Glu Arg Cys            50             - #     55             - #     60                           - - Lys Ser Gly Tyr Thr Phe Thr Ser Ile Thr Le - #u Lys Pro Pro Lys Ile        65                 - # 70                 - # 75                 - # 80        - - Asp Arg Gly Ser Tyr Tyr Gly Lys Arg Leu Le - #u Leu Pro Asp Ser Val                        85 - #                 90 - #                 95               - - Thr Glu Tyr Asp Lys Lys Leu Val Ser Arg Il - #e Gln Ile Arg Val Asn                   100      - #           105      - #           110                   - - Pro Leu Pro Lys Phe Asp Ser Thr Val Trp Va - #l Thr Val Arg Lys Val               115          - #       120          - #       125                       - - Pro Ala Ser Ser Asp Leu Ser Val Ala Ala Il - #e Ser Ala Met Phe Ala           130              - #   135              - #   140                           - - Asp Gly Ala Ser Pro Val Leu Val Tyr Gln Ty - #r Ala Ala Ser Gly Val       145                 1 - #50                 1 - #55                 1 -       #60                                                                               - - Gln Ala Asn Asn Lys Leu Leu Tyr Asp Leu Se - #r Ala Met Arg Ala         Asp                                                                                              165  - #               170  - #               175              - - Ile Gly Asp Met Arg Lys Tyr Ala Val Leu Va - #l Tyr Ser Lys Asp Asp                   180      - #           185      - #           190                   - - Ala Leu Glu Thr Asp Glu Leu Val Leu His Va - #l Asp Val Glu His Gln               195          - #       200          - #       205                       - - Arg Ile Pro Thr Ser Gly Val Leu Pro Val                                       210              - #   215                                                  - -  - - (2) INFORMATION FOR SEQ ID NO:5:                                      - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 771 base - #pairs                                                  (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: cDNA                                               - -    (iii) HYPOTHETICAL: NO                                                  - -     (vi) ORIGINAL SOURCE:                                                           (A) ORGANISM: Cucumber - #mosaic virus                                         (B) STRAIN: V-34                                                      - -     (ix) FEATURE:                                                                   (A) NAME/KEY: CDS                                                              (B) LOCATION: 3..660                                                           (D) OTHER INFORMATION: - #/codon.sub.-- start= 3                                    /function=- # "ENCAPSIDATES VIRUS RNA"                                         /product=- # "COAT PROTEIN"                                                    /gene= - #"CP"                                                                 /number= - #1                                                                  /standard.sub.-- - #name= "COAT PROTEIN"                         - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:5:                                - - CC ATG GAC AAA TCT GAA TCA ACC AGT GCT GGT - # CGT AAC CGT CGA CGT             47                                                                           Met Asp Lys Ser Glu Ser Thr Ser Ala - #Gly Arg Asn Arg Arg Arg                 220               - #  225               - #  230                            - - CGT CCG CGT CGT GGT TCC CGC TCC GCT TCC TC - #C TCT TCG GAT GCT AAC            95                                                                        Arg Pro Arg Arg Gly Ser Arg Ser Ala Ser Se - #r Ser Ser Asp Ala Asn            235                 2 - #40                 2 - #45                 2 -       #50                                                                               - - TTT AGA GTC TTG TCG CAG CAG CTT TCG CGA CT - #T AAC AAG ACG TTA         GCA      143                                                                     Phe Arg Val Leu Ser Gln Gln Leu Ser Arg Le - #u Asn Lys Thr Leu Ala                           255  - #               260  - #               265               - - GCT GGT CGT CCA ACT ATT AAC CAC CCA ACC TT - #T GTA GGG AGT GAA CGC           191                                                                        Ala Gly Arg Pro Thr Ile Asn His Pro Thr Ph - #e Val Gly Ser Glu Arg                        270      - #           275      - #           280                   - - TGT AGA CCT GGG TAC ACG TTC ACA TCT ATT AC - #C CTA AAG CCA CCA AAA           239                                                                        Cys Arg Pro Gly Tyr Thr Phe Thr Ser Ile Th - #r Leu Lys Pro Pro Lys                    285          - #       290          - #       295                       - - ATA GAC CGC GGG TCT TAC TAC GGT AAA AGG TT - #G TTA CTA CCT GAT TCA           287                                                                        Ile Asp Arg Gly Ser Tyr Tyr Gly Lys Arg Le - #u Leu Leu Pro Asp Ser                300              - #   305              - #   310                           - - GTC ACG GAA TAT GAT AAG AAG CTT GTT TCG CG - #C ATT CAA ATT CGA GTT           335                                                                        Val Thr Glu Tyr Asp Lys Lys Leu Val Ser Ar - #g Ile Gln Ile Arg Val            315                 3 - #20                 3 - #25                 3 -       #30                                                                               - - AAT CCT TTG CCG AAA TTT GAT TCT ACC GTG TG - #G GTG ACA GTT CGT         AAA      383                                                                     Asn Pro Leu Pro Lys Phe Asp Ser Thr Val Tr - #p Val Thr Val Arg Lys                           335  - #               340  - #               345               - - GTT CCT GCC TCC TCG GAC TTA TCC GTT GCC GC - #C ATC TCT GCT ATG TTC           431                                                                        Val Pro Ala Ser Ser Asp Leu Ser Val Ala Al - #a Ile Ser Ala Met Phe                        350      - #           355      - #           360                   - - GCG GAC GGA GCC TCA CCG GTA CTG GTT TAT CA - #G TAT GCT GCA TCT GGA           479                                                                        Ala Asp Gly Ala Ser Pro Val Leu Val Tyr Gl - #n Tyr Ala Ala Ser Gly                    365          - #       370          - #       375                       - - GTT CAA GCT AAC AAC AAA TTG TTG TAT GAT CT - #T TCG GCG ATG CGC GCT           527                                                                        Val Gln Ala Asn Asn Lys Leu Leu Tyr Asp Le - #u Ser Ala Met Arg Ala                380              - #   385              - #   390                           - - GAT ATA GGT GAC ATG AGA AAG TAC GCC GTC CT - #C GTG TAT TCA AAA GAC           575                                                                        Asp Ile Gly Asp Met Arg Lys Tyr Ala Val Le - #u Val Tyr Ser Lys Asp            395                 4 - #00                 4 - #05                 4 -       #10                                                                               - - GAT GCA CTC GAG ACG GAC GAG CTA GTA CTT CA - #T GTT GAC ATC GAG         CAC      623                                                                     Asp Ala Leu Glu Thr Asp Glu Leu Val Leu Hi - #s Val Asp Ile Glu His                           415  - #               420  - #               425               - - CAA CGC ATT CCC ACG TCT GGG GTG CTC CCA GT - #T TGA T TCCGTGTTCC              670                                                                        Gln Arg Ile Pro Thr Ser Gly Val Leu Pro Va - #l  *                                         430      - #           435                                          - - AGAACCCTCC CTCCGATTTC TGTGGCGGGA GCTGAGTTGG CAGTTCTGCT AT -              #AAACTGTC    730                                                                  - - TGAAGTCACT AAACGTTTTA CGGTGAACGG GTTGTCCATG G    - #                       - #  771                                                                      - -  - - (2) INFORMATION FOR SEQ ID NO:6:                                      - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 218 amino - #acids                                                 (B) TYPE: amino acid                                                           (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: protein                                            - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:6:                                - - Met Asp Lys Ser Glu Ser Thr Ser Ala Gly Ar - #g Asn Arg Arg Arg Arg         1               5 - #                 10 - #                 15               - - Pro Arg Arg Gly Ser Arg Ser Ala Ser Ser Se - #r Ser Asp Ala Asn Phe                    20     - #             25     - #             30                   - - Arg Val Leu Ser Gln Gln Leu Ser Arg Leu As - #n Lys Thr Leu Ala Ala                35         - #         40         - #         45                       - - Gly Arg Pro Thr Ile Asn His Pro Thr Phe Va - #l Gly Ser Glu Arg Cys            50             - #     55             - #     60                           - - Arg Pro Gly Tyr Thr Phe Thr Ser Ile Thr Le - #u Lys Pro Pro Lys Ile        65                 - # 70                 - # 75                 - # 80        - - Asp Arg Gly Ser Tyr Tyr Gly Lys Arg Leu Le - #u Leu Pro Asp Ser Val                        85 - #                 90 - #                 95               - - Thr Glu Tyr Asp Lys Lys Leu Val Ser Arg Il - #e Gln Ile Arg Val Asn                   100      - #           105      - #           110                   - - Pro Leu Pro Lys Phe Asp Ser Thr Val Trp Va - #l Thr Val Arg Lys Val               115          - #       120          - #       125                       - - Pro Ala Ser Ser Asp Leu Ser Val Ala Ala Il - #e Ser Ala Met Phe Ala           130              - #   135              - #   140                           - - Asp Gly Ala Ser Pro Val Leu Val Tyr Gln Ty - #r Ala Ala Ser Gly Val       145                 1 - #50                 1 - #55                 1 -       #60                                                                               - - Gln Ala Asn Asn Lys Leu Leu Tyr Asp Leu Se - #r Ala Met Arg Ala         Asp                                                                                              165  - #               170  - #               175              - - Ile Gly Asp Met Arg Lys Tyr Ala Val Leu Va - #l Tyr Ser Lys Asp Asp                   180      - #           185      - #           190                   - - Ala Leu Glu Thr Asp Glu Leu Val Leu His Va - #l Asp Ile Glu His Gln               195          - #       200          - #       205                       - - Arg Ile Pro Thr Ser Gly Val Leu Pro Val                                       210              - #   215                                                  - -  - - (2) INFORMATION FOR SEQ ID NO:7:                                      - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 25 base - #pairs                                                   (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: other nucleic acid                                          (A) DESCRIPTION: /desc - #= "Oligonucleotide Primer RMM                             351"                                                             - -    (iii) HYPOTHETICAL: NO                                                  - -     (iv) ANTI-SENSE: NO                                                    - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:7:                                - - CGTAGAATTC AGTCGAGCCA TGGAC          - #                  - #                    25                                                                       - -  - - (2) INFORMATION FOR SEQ ID NO:8:                                      - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 28 base - #pairs                                                   (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: other nucleic acid                                          (A) DESCRIPTION: /desc - #= "Oligonucleotide Primer                                 RMM352"                                                          - -    (iii) HYPOTHETICAL: NO                                                  - -     (iv) ANTI-SENSE: NO                                                    - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:8:                                - - GACCACTCGA GCCGTAAGCT CCATGGAC         - #                  - #                  28                                                                       - -  - - (2) INFORMATION FOR SEQ ID NO:9:                                      - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 960 base - #pairs                                                  (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: cDNA                                               - -    (iii) HYPOTHETICAL: NO                                                  - -     (iv) ANTI-SENSE: NO                                                    - -     (vi) ORIGINAL SOURCE:                                                           (A) ORGANISM: CUCUMBER - #MOSAIC VIRUS                                         (B) STRAIN: STRAIN C                                                  - -     (ix) FEATURE:                                                                   (A) NAME/KEY: CDS                                                              (B) LOCATION: 1..658                                                  - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:9:                                - - ATG GAC AAA TCT GAA TCA ACC AGT GCT GGT CG - #T AAC CAT CGA CGT CGT        48                                                                            Met Asp Lys Ser Glu Ser Thr Ser Ala Gly Ar - #g Asn His Arg Arg Arg            220                 2 - #25                 2 - #30                 2 -       #35                                                                               - - CCG CGT CGT GGT TCC CGC TCC GCC CCC TCC TC - #C GCG GAT GCT AAC         TTT   96                                                                         Pro Arg Arg Gly Ser Arg Ser Ala Pro Ser Se - #r Ala Asp Ala Asn Phe                           240  - #               245  - #               250               - - AGA GTC TTG TCG CAG CAG CTT TCG CGA CTT AA - #T AAG ACG TTA GCA GCT       144                                                                            Arg Val Leu Ser Gln Gln Leu Ser Arg Leu As - #n Lys Thr Leu Ala Ala                        255      - #           260      - #           265                   - - GGT CGT CCA ACT ATT AAC CAC CCA ACC TTT GT - #A GGG AGT GAA CGC TGT       192                                                                            Gly Arg Pro Thr Ile Asn His Pro Thr Phe Va - #l Gly Ser Glu Arg Cys                    270          - #       275          - #       280                       - - AGA CCT GGG TAC ACG TTC ACA TCT ATT ACC CT - #A AAG CCA CCA AAA ATA       240                                                                            Arg Pro Gly Tyr Thr Phe Thr Ser Ile Thr Le - #u Lys Pro Pro Lys Ile                285              - #   290              - #   295                           - - GAC CGT GAG TCT TAT TAC GGT AAA AGG TTG TT - #A CTA CCT GAT TCA GTC       288                                                                            Asp Arg Glu Ser Tyr Tyr Gly Lys Arg Leu Le - #u Leu Pro Asp Ser Val            300                 3 - #05                 3 - #10                 3 -       #15                                                                               - - ACG GAA TAT GAT AAG AAG CTT GTT TCG CGC AT - #T CAA ATT CGA GTT         AAT  336                                                                         Thr Glu Tyr Asp Lys Lys Leu Val Ser Arg Il - #e Gln Ile Arg Val Asn                           320  - #               325  - #               330               - - CCT TTG CCG AAA TTT GAT TCT ACC GTG TGG GT - #G ACA GTC CGT AAA GTT       384                                                                            Pro Leu Pro Lys Phe Asp Ser Thr Val Trp Va - #l Thr Val Arg Lys Val                        335      - #           340      - #           345                   - - CCT GCC TCC TCG GAC TTA TCC GTT GCC GCC AT - #C TCT GCT ATG TTC GCG       432                                                                            Pro Ala Ser Ser Asp Leu Ser Val Ala Ala Il - #e Ser Ala Met Phe Ala                    350          - #       355          - #       360                       - - GAC GGA GCC TCA CCG GTA CTG GTT TAT CAG TA - #T GCC GCA TCT GGA GTC       480                                                                            Asp Gly Ala Ser Pro Val Leu Val Tyr Gln Ty - #r Ala Ala Ser Gly Val                365              - #   370              - #   375                           - - CAA GCC AAC AAC AAA CTG TTG TTT GAT CTT TC - #G GCG ATG CGC GCT GAT       528                                                                            Gln Ala Asn Asn Lys Leu Leu Phe Asp Leu Se - #r Ala Met Arg Ala Asp            380                 3 - #85                 3 - #90                 3 -       #95                                                                               - - ATA GGT GAC ATG AGA AAG TAC GCC GTC CTC GT - #G TAT TCA AAA GAC         GAT  576                                                                         Ile Gly Asp Met Arg Lys Tyr Ala Val Leu Va - #l Tyr Ser Lys Asp Asp                           400  - #               405  - #               410               - - GCG CTC GAG ACG GAC GAG CTA GTA CTT CAT GT - #T GAC ATC GAG CAC CAA       624                                                                            Ala Leu Glu Thr Asp Glu Leu Val Leu His Va - #l Asp Ile Glu His Gln                        415      - #           420      - #           425                   - - CGC ATT CCC ACA TCT GGA GTG CTC CCA GTC TG - #A T TCCGTGTTCC              668                                                                            Arg Ile Pro Thr Ser Gly Val Leu Pro Val  - #*                                          430          - #       435                                              - - CAGAACCCTC CCTCCGATCT CTGTGGCGGG AGCTGAGTTG GCAGTTCTAC TA -              #CAAACTGT    728                                                                  - - CTGGAGTCAC TAAACGTTTT ACGGTGAACG GGTTGTCCAT CCAGCTTACG GC -             #TAAAATGG    788                                                                  - - TCAGTCGTGG AGAAATCCAC GCCAGCAGAT TTACAAATCT CTGAGGCGCC TT -             #TGAAACCA    848                                                                  - - TCTCCTAGGT TTCTTCGGAA GGGCTTCGGT CCGTGTACCT CTAGCGCAAC GT -             #GCTAGTTT    908                                                                  - - CAGGGTACGG GTGCCCCCCC ACTTTCGTGG GGGCCTCCAA AAGGAGACCA AA - #                 960                                                                        - -  - - (2) INFORMATION FOR SEQ ID NO:10:                                     - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 218 amino - #acids                                                 (B) TYPE: amino acid                                                           (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: protein                                            - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:10:                               - - Met Asp Lys Ser Glu Ser Thr Ser Ala Gly Ar - #g Asn His Arg Arg Arg         1               5 - #                 10 - #                 15               - - Pro Arg Arg Gly Ser Arg Ser Ala Pro Ser Se - #r Ala Asp Ala Asn Phe                    20     - #             25     - #             30                   - - Arg Val Leu Ser Gln Gln Leu Ser Arg Leu As - #n Lys Thr Leu Ala Ala                35         - #         40         - #         45                       - - Gly Arg Pro Thr Ile Asn His Pro Thr Phe Va - #l Gly Ser Glu Arg Cys            50             - #     55             - #     60                           - - Arg Pro Gly Tyr Thr Phe Thr Ser Ile Thr Le - #u Lys Pro Pro Lys Ile        65                 - # 70                 - # 75                 - # 80        - - Asp Arg Glu Ser Tyr Tyr Gly Lys Arg Leu Le - #u Leu Pro Asp Ser Val                        85 - #                 90 - #                 95               - - Thr Glu Tyr Asp Lys Lys Leu Val Ser Arg Il - #e Gln Ile Arg Val Asn                   100      - #           105      - #           110                   - - Pro Leu Pro Lys Phe Asp Ser Thr Val Trp Va - #l Thr Val Arg Lys Val               115          - #       120          - #       125                       - - Pro Ala Ser Ser Asp Leu Ser Val Ala Ala Il - #e Ser Ala Met Phe Ala           130              - #   135              - #   140                           - - Asp Gly Ala Ser Pro Val Leu Val Tyr Gln Ty - #r Ala Ala Ser Gly Val       145                 1 - #50                 1 - #55                 1 -       #60                                                                               - - Gln Ala Asn Asn Lys Leu Leu Phe Asp Leu Se - #r Ala Met Arg Ala         Asp                                                                                              165  - #               170  - #               175              - - Ile Gly Asp Met Arg Lys Tyr Ala Val Leu Va - #l Tyr Ser Lys Asp Asp                   180      - #           185      - #           190                   - - Ala Leu Glu Thr Asp Glu Leu Val Leu His Va - #l Asp Ile Glu His Gln               195          - #       200          - #       205                       - - Arg Ile Pro Thr Ser Gly Val Leu Pro Val                                       210              - #   215                                                  - -  - - (2) INFORMATION FOR SEQ ID NO:11:                                     - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 983 base - #pairs                                                  (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: cDNA                                               - -    (iii) HYPOTHETICAL: NO                                                  - -     (iv) ANTI-SENSE: NO                                                    - -     (vi) ORIGINAL SOURCE:                                                           (A) ORGANISM: CUCUMBER - #MOSAIC VIRUS                                         (B) STRAIN: WHITE LEAF                                                - -     (ix) FEATURE:                                                                   (A) NAME/KEY: CDS                                                              (B) LOCATION: 1..657                                                  - -      (x) PUBLICATION INFORMATION:                                                   (A) AUTHORS: Quemada, H                                                             Kearney, - #C                                                                  Gonsalves, - #D                                                                Slightom, - #J                                                            (B) TITLE: Nucleotide S - #equences of the Coat Protein                             Genes and - # Flanking Regions of Cucumber Mosaic                              Virus Str - #ains C and WL RNA 3                                          (C) JOURNAL: J. Gen. - #Virol.                                                 (D) VOLUME: 70                                                                 (F) PAGES: 1065-1073                                                           (G) DATE: 1989                                                        - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:11:                               - - ATG GAC AAA TCT GGA TCT CCC AAT GCT AGT AG - #A ACC TCC CGG CGT CGT            48                                                                        Met Asp Lys Ser Gly Ser Pro Asn Ala Ser Ar - #g Thr Ser Arg Arg Arg            220                 2 - #25                 2 - #30                 2 -       #35                                                                               - - CGC CCG CGT AGA GGT TCT CGG TCC GCT TCT GG - #T GCG GAT GCA GGG         TTG       96                                                                     Arg Pro Arg Arg Gly Ser Arg Ser Ala Ser Gl - #y Ala Asp Ala Gly Leu                           240  - #               245  - #               250               - - CGT GCT TTG ACT CAG CAG ATG CTG AAA CTC AA - #T AGA ACC CTC GCC ATT           144                                                                        Arg Ala Leu Thr Gln Gln Met Leu Lys Leu As - #n Arg Thr Leu Ala Ile                        255      - #           260      - #           265                   - - GGT CGT CCC ACT CTT AAC CAC CCA ACC TTC GT - #G GGT AGT GAA AGC TGT           192                                                                        Gly Arg Pro Thr Leu Asn His Pro Thr Phe Va - #l Gly Ser Glu Ser Cys                    270          - #       275          - #       280                       - - AAA CCC GGT TAC ACT TTC ACA TCT ATT ACC CT - #G AAA CCG CCT GAA ATT           240                                                                        Lys Pro Gly Tyr Thr Phe Thr Ser Ile Thr Le - #u Lys Pro Pro Glu Ile                285              - #   290              - #   295                           - - GAG AAA GGT TCA TAT TTT GGT AGA AGG TTG TC - #T TTG CCA GAT TCA GTC           288                                                                        Glu Lys Gly Ser Tyr Phe Gly Arg Arg Leu Se - #r Leu Pro Asp Ser Val            300                 3 - #05                 3 - #10                 3 -       #15                                                                               - - ACG GAC TAT GAT AAG AAG CTT GTT TCG CGC AT - #T CAA ATC AGG GTT         AAT      336                                                                     Thr Asp Tyr Asp Lys Lys Leu Val Ser Arg Il - #e Gln Ile Arg Val Asn                           320  - #               325  - #               330               - - CCT TTG CCG AAA TTT GAT TCT ACC GTG TGG GT - #T ACA GTT CGG AAA GTA           384                                                                        Pro Leu Pro Lys Phe Asp Ser Thr Val Trp Va - #l Thr Val Arg Lys Val                        335      - #           340      - #           345                   - - CCT TCA TCA TCC GAT CTT TCC GTC GCC GCC AT - #C TCT GCT ATG TTT GGC           432                                                                        Pro Ser Ser Ser Asp Leu Ser Val Ala Ala Il - #e Ser Ala Met Phe Gly                    350          - #       355          - #       360                       - - GAT GGT AAT TCA CCG GTT TTG GTT TAT CAG TA - #T GCT GCG TCC GGA GTT           480                                                                        Asp Gly Asn Ser Pro Val Leu Val Tyr Gln Ty - #r Ala Ala Ser Gly Val                365              - #   370              - #   375                           - - CAG GCC AAC AAT AAG TTA CTT TAT GAC CTG TC - #C GAG ATG CGT GCT GAT           528                                                                        Gln Ala Asn Asn Lys Leu Leu Tyr Asp Leu Se - #r Glu Met Arg Ala Asp            380                 3 - #85                 3 - #90                 3 -       #95                                                                               - - ATC GGC GAC ATG CGT AAG TAC GCC GTC CTG GT - #T TAC TCG AAA GAC         GAT      576                                                                     Ile Gly Asp Met Arg Lys Tyr Ala Val Leu Va - #l Tyr Ser Lys Asp Asp                           400  - #               405  - #               410               - - AAA CTA GAG AAG GAC GAG ATT GCA CTT CAT GT - #C GAC GTC GAG CAT CAA           624                                                                        Lys Leu Glu Lys Asp Glu Ile Ala Leu His Va - #l Asp Val Glu His Gln                        415      - #           420      - #           425                   - - CGA ATT CCT ATC TCA CGG ATG CTC CCG ACT TA - #G TCCGTGTGTT TACCGGCGT     C    677                                                                        Arg Ile Pro Ile Ser Arg Met Leu Pro Thr  - #*                                          430          - #       435                                              - - CGAGAACGTT AAACTACACT CTCAATCGCG AGTGCTGACT TGGTAGTATT GC -              #TTCAAACT    737                                                                  - - GCCTGAAGTC CCTAAACGTG TTGTTGCGCG GGGAACGGGT GTCCATCCAG CT -             #TACGGCTA    797                                                                  - - AAATGGTCGT GTCTTTCACA CGCCGATGTC TTACAAGATG TCGAGATACC CT -             #TGAAATCA    857                                                                  - - TCTCCTAGAT TTCTTCGGAA GGGCTTCGTG AGAAGCTCGT GCACGGTAAT AC -             #ACTTGATA    917                                                                  - - TTACCAAGAG TGCGGGTATC GCCTGTGGTT TTCCACAGGT TCTCCAGGTT CT -             #CCATAAGG    977                                                                  - - AGACCA                 - #                  - #                  -      #          983                                                                   - -  - - (2) INFORMATION FOR SEQ ID NO:12:                                     - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 218 amino - #acids                                                 (B) TYPE: amino acid                                                           (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: protein                                            - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:12:                               - - Met Asp Lys Ser Gly Ser Pro Asn Ala Ser Ar - #g Thr Ser Arg Arg Arg         1               5 - #                 10 - #                 15               - - Arg Pro Arg Arg Gly Ser Arg Ser Ala Ser Gl - #y Ala Asp Ala Gly Leu                    20     - #             25     - #             30                   - - Arg Ala Leu Thr Gln Gln Met Leu Lys Leu As - #n Arg Thr Leu Ala Ile                35         - #         40         - #         45                       - - Gly Arg Pro Thr Leu Asn His Pro Thr Phe Va - #l Gly Ser Glu Ser Cys            50             - #     55             - #     60                           - - Lys Pro Gly Tyr Thr Phe Thr Ser Ile Thr Le - #u Lys Pro Pro Glu Ile        65                 - # 70                 - # 75                 - # 80        - - Glu Lys Gly Ser Tyr Phe Gly Arg Arg Leu Se - #r Leu Pro Asp Ser Val                        85 - #                 90 - #                 95               - - Thr Asp Tyr Asp Lys Lys Leu Val Ser Arg Il - #e Gln Ile Arg Val Asn                   100      - #           105      - #           110                   - - Pro Leu Pro Lys Phe Asp Ser Thr Val Trp Va - #l Thr Val Arg Lys Val               115          - #       120          - #       125                       - - Pro Ser Ser Ser Asp Leu Ser Val Ala Ala Il - #e Ser Ala Met Phe Gly           130              - #   135              - #   140                           - - Asp Gly Asn Ser Pro Val Leu Val Tyr Gln Ty - #r Ala Ala Ser Gly Val       145                 1 - #50                 1 - #55                 1 -       #60                                                                               - - Gln Ala Asn Asn Lys Leu Leu Tyr Asp Leu Se - #r Glu Met Arg Ala         Asp                                                                                              165  - #               170  - #               175              - - Ile Gly Asp Met Arg Lys Tyr Ala Val Leu Va - #l Tyr Ser Lys Asp Asp                   180      - #           185      - #           190                   - - Lys Leu Glu Lys Asp Glu Ile Ala Leu His Va - #l Asp Val Glu His Gln               195          - #       200          - #       205                       - - Arg Ile Pro Ile Ser Arg Met Leu Pro Thr                                       210              - #   215                                                  - -  - - (2) INFORMATION FOR SEQ ID NO:13:                                     - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 218 amino - #acids                                                 (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: cDNA                                               - -    (iii) HYPOTHETICAL: NO                                                  - -     (iv) ANTI-SENSE: NO                                                    - -     (vi) ORIGINAL SOURCE:                                                           (A) ORGANISM: CUCUMBER - #MOSAIC VIRUS                                         (B) STRAIN: Q3                                                        - -      (x) PUBLICATION INFORMATION:                                                   (A) AUTHORS: Gould, AR                                                              Symons, R - #H                                                            (B) TITLE: Cucumber Mos - #aic Virus RNA 3:  Determination                          of the - #nucleotide sequence provides the amino acid                          sequences - #of protein 3a and viral coat protein                         (C) JOURNAL: Eur. J. - #Biochem                                                (D) VOLUME: 126                                                                (F) PAGES: 217-226                                                             (G) DATE: 1982                                                        - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:13:                               - - Met Asp Lys Ser Gly Ser Pro Asn Ala Ser Ar - #g Thr Ser Arg Arg Arg       1               5   - #                10  - #                15                - - Arg Pro Arg Arg Gly Ser Arg Ser Ala Ser Gl - #y Ala Asp Ala Gly Leu                   20      - #            25      - #            30                    - - Arg Ala Leu Thr Gln Gln Met Leu Arg Leu As - #n Lys Thr Leu Ala Ile               35          - #        40          - #        45                        - - Gly Arg Pro Thr Leu Asn His Pro Thr Phe Va - #l Gly Ser Glu Ser Cys           50              - #    55              - #    60                            - - Lys Pro Gly Tyr Thr Phe Thr Ser Ile Thr Le - #u Lys Pro Pro Glu Ile       65                  - #70                  - #75                  - #80         - - Glu Lys Gly Ser Tyr Phe Gly Arg Arg Leu Se - #r Leu Pro Asp Ser Val                       85  - #                90  - #                95                - - Thr Asp Tyr Asp Lys Lys Leu Val Ser Arg Il - #e Gln Ile Arg Ile Asn                   100      - #           105      - #           110                   - - Pro Leu Pro Lys Phe Asp Ser Thr Val Trp Va - #l Thr Val Arg Lys Val               115          - #       120          - #       125                       - - Pro Ser Ser Ser Asp Leu Ser Val Ala Ala Il - #e Ser Ala Met Phe Gly           130              - #   135              - #   140                           - - Asp Gly Asn Ser Pro Val Leu Val Tyr Gln Ty - #r Ala Ala Ser Gly Val       145                 1 - #50                 1 - #55                 1 -       #60                                                                               - - Gln Ala Asn Asn Lys Leu Leu Tyr Asp Leu Se - #r Glu Met Arg Ala         Asp                                                                                              165  - #               170  - #               175              - - Ile Gly Asp Met Arg Lys Tyr Ala Val Leu Va - #l Tyr Ser Lys Asp Asp                   180      - #           185      - #           190                   - - Lys Leu Glu Lys Asp Glu Ile Val Leu His Va - #l Asp Val Glu His Gln               195          - #       200          - #       205                       - - Arg Ile Pro Ile Ser Arg Met Leu Pro Thr                                       210              - #   215                                                  - -  - - (2) INFORMATION FOR SEQ ID NO:14:                                     - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 772 base - #pairs                                                  (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: cDNA                                               - -    (iii) HYPOTHETICAL: NO                                                  - -     (iv) ANTI-SENSE: NO                                                    - -     (vi) ORIGINAL SOURCE:                                                           (A) ORGANISM: Cucumber - #Mosaic Virus                                         (C) INDIVIDUAL ISOLATE: - #A35                                        - -     (ix) FEATURE:                                                                   (A) NAME/KEY: CDS                                                              (B) LOCATION: 3..660                                                  - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:14:                               - - CC ATG GAC AAA TCT GAA TCA ACC AGT GCT GGT - # CGT AAC CGT CGA CGT             47                                                                           Met Asp Lys Ser Glu Ser Thr Ser Ala - #Gly Arg Asn Arg Arg Arg                 220               - #  225               - #  230                            - - CGT CCG CGT CGT GGT TCC CGC TCC GCC CTC TC - #C TCC GCG GAT GCT AAC            95                                                                        Arg Pro Arg Arg Gly Ser Arg Ser Ala Leu Se - #r Ser Ala Asp Ala Asn            235                 2 - #40                 2 - #45                 2 -       #50                                                                               - - TTT AGA GTC CTG TCG CAG CAG CTT TCG CGA CT - #T AAT AAG ACG TTA         GCA      143                                                                     Phe Arg Val Leu Ser Gln Gln Leu Ser Arg Le - #u Asn Lys Thr Leu Ala                           255  - #               260  - #               265               - - GCT GGT CGT CCA ACT ATT AAC CAC CCA ACC TT - #T GTA GGG AGT GAA CGC           191                                                                        Ala Gly Arg Pro Thr Ile Asn His Pro Thr Ph - #e Val Gly Ser Glu Arg                        270      - #           275      - #           280                   - - TGT AGA CCT GGG TAC ACG TTC ACA TCT ATT AC - #C CTA AAG CCA CCA AAA           239                                                                        Cys Arg Pro Gly Tyr Thr Phe Thr Ser Ile Th - #r Leu Lys Pro Pro Lys                    285          - #       290          - #       295                       - - ATA GAC CGT GGG TCT TAT TAC GGT AAA AGG TT - #G TTA CTA CCT GAT TCA           287                                                                        Ile Asp Arg Gly Ser Tyr Tyr Gly Lys Arg Le - #u Leu Leu Pro Asp Ser                300              - #   305              - #   310                           - - GTC ACA GAA TAT GAT AAG AAG CTT GTT TCG CG - #C ATT CAA ATT CGA GTT           335                                                                        Val Thr Glu Tyr Asp Lys Lys Leu Val Ser Ar - #g Ile Gln Ile Arg Val            315                 3 - #20                 3 - #25                 3 -       #30                                                                               - - AAT CCT TTG CCG AAA TTT GAT TCT ACC GTG TG - #G GTG ACA GTC CGT         AAA      383                                                                     Asn Pro Leu Pro Lys Phe Asp Ser Thr Val Tr - #p Val Thr Val Arg Lys                           335  - #               340  - #               345               - - GTT CCT GCC TCC TCG GAC TTA TCC GTT GCC GC - #C ATC TCT GCT ATG TTC           431                                                                        Val Pro Ala Ser Ser Asp Leu Ser Val Ala Al - #a Ile Ser Ala Met Phe                        350      - #           355      - #           360                   - - GCG GAC GGA GCC TCA CCG GTA CTG GTT TAT CA - #G TAT GCC GCA TCT GGA           479                                                                        Ala Asp Gly Ala Ser Pro Val Leu Val Tyr Gl - #n Tyr Ala Ala Ser Gly                    365          - #       370          - #       375                       - - GTC CAA GCC AAC AAC AAA CTG TTG TAT GAT CT - #T TCG GCG ATG CGC GCT           527                                                                        Val Gln Ala Asn Asn Lys Leu Leu Tyr Asp Le - #u Ser Ala Met Arg Ala                380              - #   385              - #   390                           - - GAT ATA GGT GAC ATG AGA AAG TAC GCC GTC CT - #C GTG TAT TCA AAA GAC           575                                                                        Asp Ile Gly Asp Met Arg Lys Tyr Ala Val Le - #u Val Tyr Ser Lys Asp            395                 4 - #00                 4 - #05                 4 -       #10                                                                               - - GAT GCG CTC GAG ACG GAC GAG CTA GTA CTT CA - #T GTT GAC ATC GAG         CAC      623                                                                     Asp Ala Leu Glu Thr Asp Glu Leu Val Leu Hi - #s Val Asp Ile Glu His                           415  - #               420  - #               425               - - CAA CGC ATT CCC ACG TCT GGA GTG CTC CCA GT - #C TGA T TCTGTGTTCC              670                                                                        Gln Arg Ile Pro Thr Ser Gly Val Leu Pro Va - #l  *                                         430      - #           435                                          - - CAGAACCCTC CCTCCGATCT CTGTGGCGGG AGCTGAGTTG GCAGTTCTGC TG -              #TAAACTGT    730                                                                  - - CTGAAGTCAC TAAACGTTTT ACGGTGAACG GGTTGTCCAT GG    - #                       - # 772                                                                      - -  - - (2) INFORMATION FOR SEQ ID NO:15:                                     - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 218 amino - #acids                                                 (B) TYPE: amino acid                                                           (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: protein                                            - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:15:                               - - Met Asp Lys Ser Glu Ser Thr Ser Ala Gly Ar - #g Asn Arg Arg Arg Arg         1               5 - #                 10 - #                 15               - - Pro Arg Arg Gly Ser Arg Ser Ala Leu Ser Se - #r Ala Asp Ala Asn Phe                    20     - #             25     - #             30                   - - Arg Val Leu Ser Gln Gln Leu Ser Arg Leu As - #n Lys Thr Leu Ala Ala                35         - #         40         - #         45                       - - Gly Arg Pro Thr Ile Asn His Pro Thr Phe Va - #l Gly Ser Glu Arg Cys            50             - #     55             - #     60                           - - Arg Pro Gly Tyr Thr Phe Thr Ser Ile Thr Le - #u Lys Pro Pro Lys Ile        65                 - # 70                 - # 75                 - # 80        - - Asp Arg Gly Ser Tyr Tyr Gly Lys Arg Leu Le - #u Leu Pro Asp Ser Val                        85 - #                 90 - #                 95               - - Thr Glu Tyr Asp Lys Lys Leu Val Ser Arg Il - #e Gln Ile Arg Val Asn                   100      - #           105      - #           110                   - - Pro Leu Pro Lys Phe Asp Ser Thr Val Trp Va - #l Thr Val Arg Lys Val               115          - #       120          - #       125                       - - Pro Ala Ser Ser Asp Leu Ser Val Ala Ala Il - #e Ser Ala Met Phe Ala           130              - #   135              - #   140                           - - Asp Gly Ala Ser Pro Val Leu Val Tyr Gln Ty - #r Ala Ala Ser Gly Val       145                 1 - #50                 1 - #55                 1 -       #60                                                                               - - Gln Ala Asn Asn Lys Leu Leu Tyr Asp Leu Se - #r Ala Met Arg Ala         Asp                                                                                              165  - #               170  - #               175              - - Ile Gly Asp Met Arg Lys Tyr Ala Val Leu Va - #l Tyr Ser Lys Asp Asp                   180      - #           185      - #           190                   - - Ala Leu Glu Thr Asp Glu Leu Val Leu His Va - #l Asp Ile Glu His Gln               195          - #       200          - #       205                       - - Arg Ile Pro Thr Ser Gly Val Leu Pro Val                                       210              - #   215                                                __________________________________________________________________________ 

What is claimed is:
 1. An isolated and purified DNA molecule comprising DNA encoding the coat protein of the V27 strain of cucumber mosaic virus.
 2. The isolated and purified DNA molecule of claim 1 wherein the DNA molecule has the nucleotide sequence shown in FIG. 1 [SEQ ID NO:1].
 3. A vector comprising a chimeric expression cassette comprising the DNA molecule of claim 1, a promoter and a polyadenylation signal, wherein the promoter is operably linked to the DNA molecule, and the DNA molecule is operably linked to the polyadenylation signal.
 4. The vector of claim 3 wherein the promoter is the cauliflower mosaic virus 35S promoter.
 5. The vector of claim 4 wherein the polyadenylation signal is the polyadenylation signal of the cauliflower mosaic 35S gene.
 6. A bacterial cell comprising the vector of claim
 3. 7. The bacterial cell of claim 6 wherein the bacterial cell is selected from the group consisting of an Agrobacterium tumefaciens cell and an Agrobacterium rhizogenes cell.
 8. A transformed plant cell transformed with the vector of claim
 3. 9. The transformed plant cell of claim 8 wherein the promoter is cauliflower mosaic virus 35S promoter and the polyadenylation signal is the polyadenylation signal of the cauliflower mosaic 35S gene.
 10. A plant selected from the family Cucurbitaceae comprising a plurality of the transformed cells of claim
 8. 11. A plant selected from the family Solanaceae comprising a plurality of the transformed cells of claim
 8. 12. A method of preparing a cucumber mosaic viral resistant plant comprising:(a) transforming plant cells with a chimeric expression cassette comprising a promoter functional in plant cells operably linked to a DNA molecule that encodes a coat protein; wherein the DNA molecule is derived from cucumber mosaic virus strain V27; (b) regenerating the plant cells to provide a differentiated plant; and (c) identifying a transformed plant that expresses the cucumber mosaic virus coat protein at a level sufficient to render the plant resistant to infection by the cucumber mosaic virus strain.
 13. The method of claim 12 wherein the plant is a dicot.
 14. The method of claim 13 wherein the dicot is selected from the family Cucurbitaceae.
 15. The method of claim 13 wherein the dicot is selected from the family Solanaceae.
 16. A vector comprising a chimeric expression cassette comprising the DNA molecule of claim 1 and at least one chimeric expression cassette comprising a heterologous CMV coat protein gene which is not the coat protein gene of the V27 strain of CMV, a papaya ringspot virus coat protein gene, a zucchini yellow mosaic virus coat protein gene, or a watermelon mosaic virus-2 coat protein gene, wherein each expression cassette comprises a promoter and a polyadenylation signal, wherein the promoter is operably linked to the DNA molecule or coat protein gene, and the DNA molecule or coat protein gene is operably linked to the polyadenylation signal.
 17. A bacterial cell comprising the vector of claim
 16. 18. A transformed plant cell transformed with the vector of claim
 16. 19. The transformed plant cell of claim 18 wherein the promoter is cauliflower mosaic virus 35S promoter and the polyadenylation signal is the polyadenylation signal of the cauliflower mosaic 35S gene. 